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RECEPTOR FINGERPRINTING, SENSORY PERCEPTION, AND 
BIOSENSORS OF CHEMICAL SENSANTS 

5 Cross Reference to Related Application 

This application is related to U.S. Provisional Serial No. 60/213,812, filed 
June 22, 2000, and U.S. Serial No. 09/804,291, filed March 13, 2001, which are 
incorporated by reference in their entirety. 

10 Background of the Invention 

Field of the Invention 

The invention relates to the use of sensory G protein-coupled receptor 
complexes that recognize chemical sensants, particularly those involving olfactory and 
taste receptors; polypep-tide fragments and mutants thereof; classes of such receptors; 
1 5 polynucleotides encoding such receptors, fragments and mutants thereof, and 

representatives of receptor classes; genetic vectors including such polynucleotides; 
and cells and non-human organisms engineered to express such receptor complexes, 
fragments and mutants of an olfactory or taste receptor, and representatives of receptor 
classes to simulate sensory perception of odorants and tastants. The invention also 
20 relates to the use of such products as a biosensor or component thereof to detect, to 
identify, to measure, or otherwise process the event of binding between the receptor 
and its cognate ligand (i.e., chemical sensant). The invention has application, for 
example, in the design and formu-lation of odorant and tastant compositions. 

25 Description of the Related Art 

The olfactory and taste systems provide sensory information about the 
chemical environ-ment. Olfactory receptors and taste receptors recognize, 
respectively, "odorants" and "tastants," collectively referred to as "sensants" or 
"sensory receptor ligands" herein. A "primary" sensant is an odorant or tastant ligand 

30 that substantially binds to sensory receptors with a ligand-binding site of a single 
amino acid sequence. Olfactory and taste receptors belong to the superfamily of 
seven-transmembrane guanyl nucleotide-binding proteins: such receptors are, 
however, also recognized as distinct families, or sub-genuses, of olfactory or taste 
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receptors (see Raming Nature 361 :353,'l993). These receptors control diverse 
physiological functions such as media-ting signaling from an external chemical 
stimulus across the membrane containing the receptor into a cell, endocrine function, 
exocrine function, heart rate, lipolysis, and carbohydrate metabolism. Thus, the 
5 dissection of these diverse functions into component signals is needed. 

But the complexities of sensory perception of chemical sensants prevent easy 
translation of the olfaction and taste systems to a machine sensor. For example, U.S. 
Patent Nos. 5,675,070; 5,918,257; 5,928,609; and 6,085,576 disclose machine sensors 
that use various chemistries, but they do not take advantage of the specificity of 
1 0 olfactory and taste receptors for their cognate ligands to produce a biosensor. 
^ WO 00/15269 discloses methods and apparatus for odor reproduction. The 

total affinities of a specific odorant with a group of receptors was called the affinity 
fingerprint of the odorant. This odorant fingerprint was represented by a vector of 
affinity values. It was proposed to repro-duce an arbitrary odor by inputting its sensed 
15 odorant fingerprint into a device, which has a palate of predetermined odorants and 
produces a composite odor using predetermined odorant finger-prints by minimizing 
the difference between vectors representing the sensed odorant fingerprint and the 
predetermined odorant fingerprints. This reference, however, does not teach or 
suggest the human olfactory receptors disclosed herein. 
20 WO 00/70343 discloses biosensors and sense replication systems using G- 

protein coupled receptors (GPCR). It was proposed to mimic the response of the G- 
protein signal transduction system by detecting the affinity of a stimulus to a plurality 
of GPCR, codifying such information into electronic signals, and reproducing the 
stimulus by converting the codified information into a combination of stimulant 
25 entities. But this reference also does not teach or suggest the human olfactory 
receptors disclosed herein. 

WO 01/27158 discloses olfactory receptors and their use to determine the 
correspondence between individual odorant receptors and particular odors. It was 
proposed that the interactions between an odor and olfactory receptors can be used to 
30 represent the odor and to re-create it. This reference, however, does not teach or 
suggest the human olfactory receptors disclosed herein. 

Dissecting the function of sensory receptors by binding sensory receptors of a 
clone of cells expressing a single sensory receptor gene, fragmentation of sensory 

2 

^SDOCID: <WO_0198S26A2_I_> 



WO 01/98526 PCTAJSOl/20122 

receptors to provide ligand-binding or signal-transducing domains thereof, and 
construction of fusion sensory recep-tor proteins will separate the diverse 
physiological functions associated with sensory perception at the level of ligand- 
receptor binding. Furthermore, novel "primary sensants" that are identified and 
5 isolated by the methods described herein may be used to further define the function of 
the cognate sensory receptor by uniquely enhancing or, in the alternative, blocking 
stimulation of sensory receptors with a single ligand-binding domain. 

The present invention addresses the need for better understanding of these 
ligand-receptor interactions by using a large set of identified sensory receptors. Even 
10 if the receptor set used is incomplete (i.e., a partial set of all sensory receptors encoded 
in the genome), a large number of sensants will be detected. Moreover, redundancy in 
the chemical structures recognized by the sensory receptors or combinatorial 
processing of signals from different sensory receptors would allow broad coverage of 
chemically diverse sensants (e.g., by selection of a representative class of sensory 
15 receptors). Also provided are, inter alia, methods for utilizing such sensory receptors 
and biosensors to simulate sensory perception. To analyze ligand-receptor 
interactions and their effects on cell signaling and the processing of those signals in 
sensory perception, specific sensants and their cognate receptor complexes are 
detected, identified, and measured under binding conditions. Fragrances and 
20 flavorings can be detected, identified, measured, and/or custom designed by the 

methods herein described. In addition, drugs that incorporate artificial odors and/or 
tastes can be formulated. 

Summary of the Invention 

25 Large gene families encoding mammalian olfactory G-protein-coupled 

receptors (OLFR) and mammalian taste G-protein-coupled receptors (TASR) are 
known in the prior art or are disclosed herein. An object of the invention to provide 
fragments and variants of such OLFRs and TASRs which retain odorant- or tastant- 
binding activity, respectively. The large number of sensory receptors that are made 

30 available herein and now amendable to manipulation raises the confidence that a 
substantially complete, or at least functional, repertoire of sensory receptors is 
provided. 
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It is an object of the invention to provide nucleic acid sequences or molecules 
that encode such sensory receptors, or fragments or variants thereof. Another object is 
to provide expression vectors which include nucleic acid sequences that encode such 
sensory receptors, or fragments or variants thereof. It is yet another object of the 
5 invention to provide human or non-human cells which functionally express at least 
one of such sensory receptors, or fragments or variants thereof. Still another object is 
to provide sensory receptor fusion proteins or other polypeptides which include at 
least a fragment of at least one of such sensory receptors. In particular, fusions with 
reporter molecules or other heterologous amino acid sequences may maintain the 
10 original ligand-properties while changing the signaling properties to allow easier 
detection of sensant binding to the receptor {e.g., change in fluorescent signal). 
Moreover, chimeric proteins or other polypeptides with altered ligand-binding and/or 
signaling properties can be made from two or more different sensory receptors by 
mixing domains. 

15 The invention provides methods for representing the sensory perception of one 

or more chemicals (e.g., a primary sensant or mixture thereof) and/or for predicting 
the sensory percep-tion of one or more chemicals in a mammal (e.g., human) using the 
aforementioned products. Given a known member of a ligand-receptor binding pair, 
one or both members of the pair (i.e., ligand, receptor, or both) may be detected, 

20 identified, and/or measured under binding conditions. 

Novel molecules or combinations of molecules which elicit a desired and 
predetermined sensory perception in a mammal (e.g., human) can be generated by 
determining a value of sensory perception in a mammal for a known molecule or 
combination thereof; determining a value of sensory perception in a mammal for one 

25 or more unknown molecules or combinations thereof; comparing the value of sensory 
perception in a mammal for one or more unknown corn-positions to the value of 
sensory perception in a mammal for one or more known compositions; selecting a 
molecule or combination of molecules that elicits a predetermined sensory perception 
in a mammal; and mixing two or more unknown molecules or combinations thereof to 

30 form a molecule or combination thereof that elicits a predetermined sensory 
perception in a mammal. The combining step yields a single molecule or a 
combination thereof that elicits a predetermined sensory perception in a mammal. In 
particular, primary odorants that uniquely bind to olfactory receptors with a single 
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ligand-binding domain may be such novel molecules. Variants thereof may be used 
(1) to enhance stimulation of a limited response by olfactory receptors with a single 
ligand-binding domain or (2) to block stimulation of olfactory receptors with a single 
ligand-binding domain to reduce or inhibit olfactory perception. An alternative 
5 method for identifying primary odorants is to identify the mutated receptors in genetic 
anosmias because that odorant would be expected to be recognized by only one or a 
few olfactory receptors affected by the mutation. 

Detailed Description of the Invention 

10 Perception of Chemical Sensants 

U.S. Patent No. 5,691,188 describes how upon binding of ligand to receptor, 
the receptor presumably undergoes a conformational change leading to activation of 
the G protein. The G proteins are comprised of three subunits: a guanyl nucleotide 
binding a subunit, a p subunit, and a y subunit. G proteins cycle between two forms, 
1 5 depending on whether GDP or GTP is bound to the a subunit. When GDP is bound, 
the G protein exists as a heterotrimer: the Gapy complex. When GTP is bound, the a 
subunit dissociates from the heterotrimer, leaving a Gpy complex. When a Gapy 
complex operatively associates with an activated G protein coupled receptor in a cell 
membrane, the rate of exchange of GTP for bound GDP is increased and the rate of 
20 disso-ciation of the bound Goc subunit from the GaPy complex increases. The free 
Ga subunit and GPy complex are thus capable of transmitting a signal to downstream 
elements of a variety of signal transduction pathways. These events from the basis for 
a multiplicity of different cell signaling phenomena, including for example the 
signaling phenomena that are identified as neurological sensory perceptions such as 
25 taste and/or smell. A high-resolution X-ray crystal structure is available for 

rhodopsin, a guanyl nucleotide binding protein, has been solved (Palczewski et al, 
Science 289:739, 2000). Using this structure, the portions of the amino acid sequence 
of sensant receptors that are responsible for ligand binding can be identified. 

30 

Sensory Perception - Olfaction 
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. Genes encoding the olfactory receptors are active primarily in olfactory 
neurons (Axel Sci. Amer. 273:154, 1995). Individual olfactory receptor types are 
expressed in subsets of cells distributed in distinct zones of the olfactory epithelium 
(Breer Semin. Cell Biol. 5:25, 1994). The human genome contains thousands of genes 
5 that encode a diverse repertoire of olfactory receptors (Rouquier Nat. Genet. 1 8:243, 
1998; Trask Hum. Mol Genet 7:2007, 1998). An understanding of an animal's 
ability to detect and discriminate among the thousands of distinct odorants or tastants, 
and more particularly to distinguish, for example beneficial tastants or odorants from 
toxic tastants or odorants, is complicated by the fact that sensory receptors belong to a 

10 multigene family with over a thousand members, and the odorant receptors number at 
least 500 to 1,000. Moreover, each sensory receptor neuron may express only one or a 
few of these receptors. With respect to odorant receptors, any given olfactory neuron 
can respond to a small set of odorant ligands. In addition, odorant discrimination for a 
given neuron may depend on the ligand specificity of the one or few receptors it 

15 expresses. 

Sensory Perception - Taste 

Mammals are believed to have five basic taste modalities: sweet, bitter, sour, 
salty, and umami (the taste of monosodium glutamate). See, e.g., Kawamura et al, 

20 Introduction to Umami: A Basic Taste (1987); Kinnamon et al,Ann. Rev. Physiol, 

54:715, 1992; Lindemann, Physiol Rev., 76:718, 1996; Stewart et al 9 Am. J. Physiol, 
272:1 , 1997. Numerous physiological studies in animals have shown that taste 
receptor cells may selectively respond to different chemical stimuli. See, e.g., Akabas 
et al, Science, 242: 1047rl 988; Gilbertson era/., J. Gen. Physiol, 100:803, 1992; 

25 Bernhardt et al, J. Physiol, 490:325, 1996; Cummings et al, J. Neurophysiol, 
75:1256, 1996. 

In mammals, taste receptor cells are assembled into taste buds that are 
distributed into different papillae in the tongue epithelium. Circumvallate papillae, 
found at the very back of the tongue; contain hundreds, e.g., mice, to thousands, e.g., 
30 human, of taste buds. By contrast, foli-ate papillae, localized to the posterior lateral 
edge of the tongue, only contain dozens to hundreds of taste buds. Moreover, 
fungiform papillae contain only a single or a few taste buds, and are at the front of the 
tongue. 

6 
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AC010814, AC01 8700, AC021 304, AC008620, AC01 1 537, AC010760, AC027641, 
AC017103, AC024729, AC024257, AC025115, AP001524, AP000916, AC010814, 
AL162254, AC025234, AP001521, AC026090, AC019088, AC016856, AC016787, 
AC009594, AC026038, AQ628489, AC025942, AL163152, AC026975, AC024654, 

5 AP001803, AP001804, AL353767, AP001884, AC026083, AC018793, AP000818, 
AL353894, AL049734, AL355366, AC011464, AC037472, AC036111, AC019093, 
AC027239, AC027522, AC009545, AC021333, AC036216, AC021935, AC022762, 
AL356019, AC055861, AC018375, AC072059, AC068339, AC022891, AL357039, 
AP002345, AC044810, AC0731 13, AC024399, AC023564, AL390860, AC074365, 

10 AP002826, AL359636, AL391534, AC055731, AC076959, AP002826, AC019088, 
AC009779, AL445307, AP002512, AP000818, AC079190) by virtue of their 
sequence homology to some of the known human and other mammalian olfactory 
receptor genes. Similarly, genes encoding five and thirty-six (36) distinct, novel 
human T1R and T2R taste receptors, respectively, have been identified in genome 

1 5 sequence databases. 

Alternatively, nucleic acids encoding the sensory receptors and other related 
polypep-tides can be isolated from a variety of sources, genetically engineered, 
amplified, synthesized, and/or expressed recombinantly according to the methods 
disclosed in WO 00/35374, which is herein incorporated by reference in its entirety. 

20 These nucleic acids provide probes for the identification of cells expressing 

sensory receptors, as the nucleic acids are specifically expressed in such cells. They 
can also serve as tools for the generation of sensory topographical maps that elucidate 
the relationship between cells expressing sensory receptors and sensory neurons 
leading to particular regions of the brain. Furthermore, the nucleic acids and the 

25 polypeptides the v encode can be used as probes to elucidate olfactant- or tastant- 
induced behaviors. 

Nucleic acid molecules encoding a sensory receptor comprising a nucleic acid 
sequence that is at least 75%, 85%, 90%, 95%, or 99% identical to a nucleic acid 
sequence selected from those known in the prior art or disclosed herein are considered 
30 variants. Other nucleic acid molecules comprising a nucleic acid sequence that 

encodes a polypeptide having an amino acid sequence at least 75%, 85%, 90%, 95%, 
or 99% identical to an amino acid sequence selected from those known in the prior art 
or disclosed herein are also considered variants. Further variants contain amino acid 
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sequence differences in at most ten, five, four, three, two, or one amino acid 
residue(s). 

Exemplary nucleic acid sequences may be selected from the group consisting 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, 
5 SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 1 8, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 40, 
SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 48, SEQ ID NO: 50, 
SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, SEQ ID NO: 58, SEQ ID NO: 60, 
1 0 SEQ ID NO: 62, SEQ ID NO: 64, SEQ ID NO: 66, SEQ ID NO: 68, SEQ ID NO: 70, 
SEQ ID NO: 72, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, 

i 

SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 90, 
SEQ ID NO: 92, SEQ ID NO: 94, SEQ ID NO: 96, SEQ ID NO: 98, SEQ ID 
NO: 100, SEQ ID NO: 102, SEQ ID NO: 104, SEQ ID NO: 106, SEQ ID NO: 108, 

15 SEQ ID NO: 110, SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 116, SEQ ID 
NO: 1 1 8, SEQ ID NO: 120, SEQ ID NO: 122, SEQ ID NO: 124, SEQ ID NO: 126, 
SEQ ID NO: 128, SEQ ID NO: 130, SEQ ID NO: 132, SEQ ID NO: 134, SEQ ID 
NO: 136, SEQ ID NO: 138, SEQ ID NO: 140, SEQ ID NO: 142, SEQ ID NO: 144, 
SEQ ID NO: 146, SEQ ID NO: 148, SEQ ID NO: 150, SEQ ID NO: 152, SEQ ID 

20 NO: 154, SEQ ID NO: 156, SEQ ED NO: 158, SEQ ID NO: 160, SEQ ID NO: 162, 
SEQ ID NO: 164, SEQ ID NO: 166, SEQ ID NO: 168, SEQ ID NO: 170, SEQ ID 
NO: 172, SEQ ID NO: 174, SEQ ID NO: 176, SEQ ID NO: 178, SEQ ID NO: 180, 
SEQ ID NO: 182, SEQ ID NO: 184, SEQ ID NO: 186, SEQ ID NO: 188, SEQ ID 
NO: 190, SEQ ID NO: 192, SEQ ID NO: 194, SEQ ID NO: 196, SEQ ID NO: 198, 

25 SEQ ID NO: 200, SEQ ID NO: 202, SEQ ID NO: 204. SEQ ID NO: 206, SEQ ID 
NO: 208, SEQ ID NO: 210, SEQ ID NO: 212, SEQ ID NO: 214, SEQ ID NO: 216, 
SEQ ID NO: 2 1 8, SEQ ID NO: 220, SEQ ID NO: 222, SEQ ID NO: 224, SEQ ID 
NO: 226, SEQ ID NO: 228, SEQ ID NO: 230, SEQ ID NO: 232, SEQ ID NO: 234, 
SEQ ID NO: 236, SEQ ID NO: 238, SEQ ID NO: 240, SEQ ID NO: 242, SEQ ID 

30 NO: 244, SEQ ID NO: 246, SEQ ID NO: 248, SEQ ID NO: 250, SEQ ID NO: 252, 
SEQ ID NO: 254, SEQ ED NO: 256, SEQ ID NO: 258, SEQ ED NO: 260, SEQ ID 
NO: 262, SEQ ID NO: 264, SEQ ID NO: 266, SEQ ED NO: 268, SEQ ED NO: 270, 
SEQ ID NO: 272, SEQ ED NO: 274, SEQ ED NO: 276, SEQ ED NO: 278, SEQ ED 

8 
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NO: 280, SEQ ID NO: 282, SEQ ID NO: 284, SEQ ID NO: 286, SEQ ID NO: 288, 
SEQ ID NO: 290, SEQ ID NO: 292, SEQ ID NO: 294, SEQ ID NO: 296, SEQ ID 
NO: 298, SEQ ID NO: 300, SEQ ID NO: 302, SEQ ID NO: 304, SEQ ID NO: 306, 
SEQ ID NO: 308, SEQ ID NO: 310, SEQ ID NO: 312, SEQ ID NO: 314, SEQ ID 
5 NO: 316, SEQ ID NO: 318, SEQ ID NO: 320, SEQ ID NO: 322, SEQ ID NO: 324, 
SEQ ID NO: 326, SEQ ID NO: 328, SEQ ID NO: 330, SEQ ID NO: 332, SEQ ID 
NO: 334, SEQ ID NO: 336, SEQ ID NO: 338, SEQ ID NO: 340, SEQ ID NO: 342, 
SEQ ID NO: 344, SEQ ID NO: 346, SEQ ID NO: 348, SEQ ID NO: 350, SEQ ID 
NO: 352, SEQ ID NO: 354, SEQ ID NO: 356, SEQ ID NO: 358, SEQ ID NO: 360, 
10 SEQ ID NO: 362, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID 
NO: 370, SEQ ID NO: 372, SEQ ID NO: 374, SEQ ID NO: 376, SEQ ID NO: 378, 
SEQ ID NO: 380, SEQ ID NO: 382, SEQ ID NO: 384, SEQ ID NO: 386, SEQ ID 
NO: 388, SEQ ID NO: 390, SEQ ID NO: 392, SEQ ID NO: 394, SEQ ID NO: 396, 
SEQ ID NO: 398, SEQ ID NO: 400, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID 
15 NO: 406, SEQ ID NO: 408, SEQ ID NO: 410, SEQ ID NO: 412, SEQ ID NO: 414, 
SEQ ID NO: 41 6, SEQ ID NO: 41 8, SEQ ID NO: 420, SEQ ID NO: 422, SEQ ID 
NO: 424, SEQ ID NO: 426, SEQ ID NO: 428, SEQ ID NO: 430, SEQ ID NO: 432, 
SEQ ID NO: 434, SEQ ID NO: 436, SEQ ID NO: 438, SEQ ID NO: 440, SEQ ID 
NO: 442, SEQ ID NO: 444, SEQ ID NO: 446, SEQ ID NO: 448, SEQ ID NO: 450, 
20 SEQ ID NO: 452, SEQ ID NO: 454, SEQ ID NO: 456, SEQ ID NO: 458, SEQ ID 
NO: 460, SEQ ID NO: 462, SEQ ID NO: 464, SEQ ID NO: 466, SEQ ID NO: 468, 
SEQ ID NO: 470, SEQ ID NO: 472, SEQ ID NO: 474, SEQ ID NO: 476, SEQ ID 
NO: 478, SEQ ID NO: 480, SEQ ID NO: 482, SEQ ID NO: 484, SEQ ID NO: 486, 
SEQ ID NO: 488, SEQ ID NO: 490, SEQ ID NO: 492, SEQ ID NO: 494, SEQ ID 
25 NO: 496, SEQ ID NO: 498, SEQ ID NO: 500, SEQ ID NO: 502, SEQ ID NO: 504, 
SEQ ID NO: 506, SEQ ID NO: 508, SEQ ID NO: 510; and SEQ ID NO: 512. 

Nucleic acid molecule comprising a nucleic acid sequence that encodes a 
fragment of a polypeptide having an amino acid sequence selected from those known 
in the prior art or dis-closed herein; wherein the fragment is at least ten, 20, 30, 50, 70, 
30 1 00, or 1 50 amino acidrresi-dues in length, are useful as probes, primers, and to 
construct hybrids or chimerae. 

Polypeptides comprising an amino acid sequence that is at least 90%, 95%, 
96%, 97%, 98%, or 99% identical to an amino acid sequence selected from those 

9 
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known in the prior art or disclosed herein are considered variants. Further variants 
contain amino acid sequence differ-ences in at most ten, five, four, three, two, or one 
amino acid residue(s). Other polypeptides comprising a fragment of a polypeptide 
having an amino acid sequence selected from those known in the prior art or disclosed 
5 herein; wherein the fragment is at least 40, 60, 80, 1 00, 1 50, 200, or 250 amino acid 
residues in length, "are useful as specific binders of sensants, competitive binders, 
antigens, and to construct hybrids or chimerae. 

Exemplary amino acid sequences maybe selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ 
10 ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ 
ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ 
ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 39, SEQ 
ID NO: 41, SEQ ID NO: 43, SEQ ID NO: 45, SEQ ID NO: 47, SEQ ID NO: 49, SEQ 
ID NO: 51, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 57, SEQ ID NO: 59, SEQ 
15 ID NO: 61 , SEQ ID NO: 63, SEQ ID NO: 65, SEQ ID NO: 67, SEQ ID NO: 69, SEQ 
ID NO: 71, SEQ ID NO: 73, SEQ ID NO: 75, SEQ ID NO: 77, SEQ ID NO: 79, SEQ 
ID NO: 81, SEQ ID NO: 83, SEQ ID NO: 85, SEQ ID NO: 87, SEQ ID NO: 89, SEQ 
ID NO: 91, SEQ ID NO: 93, SEQ ID NO: 95, SEQ ID NO: 97, SEQ ID NO: 99, SEQ 
ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, SEQ ID NO: 107, SEQ ID NO: 109, 
20 SEQ ID NO: 1 1 1, SEQ ID NO: 1 13, SEQ ID NO: 1 15, SEQ ID NO: 1 17, SEQ ED 
NO: 1 19, SEQ ID NO: 121, SEQ ID NO: 123, SEQ ID NO: 125, SEQ ID NO: 127, 
SEQ ID NO: 129, SEQ ID NO: 131, SEQ ID NO: 133, SEQ ID NO: 135, SEQ ID 
NO: 137, SEQ ID NO: 139, SEQ ID NO: 141, SEQ ID NO: 143, SEQ ID NO: 145, 
SEQ ID NO: 147, SEQ ID NO: 149, SEQ ID NO: 151, SEQ ID NO: 153, SEQ ID 
25 NO: 155, SEQ ID NO: 157, SEQ ID NO: 159, SEQ ID NO: 161, SEQ ID NO: 163, 
SEQ ID NO: 165, SEQ ID NO: 167, SEQ ID NO: 169, SEQ ID NO: 171, SEQ ID 
NO: 173, SEQ ID NO: 175, SEQ ID NO: 177, SEQ ID NO: 179, SEQ ID NO: 181, 
SEQ ID NO: 1 83, SEQ ID NO: 1 85, SEQ ID NO: 1 87, SEQ ID NO: 1 89, SEQ ID 
NO: 191, SEQ ID NO: 193, SEQ ID NO: 195, SEQ ID NO: 197, SEQ ID NO: 199, 
30 SEQ ID NO: 201, SEQ ID NO: 203, SEQ ID NO: 205, SEQ ID NO: 207, SEQ ID 
NO: 209, SEQ ID NO: 211, SEQ ID NO: 213, SEQ ID NO: 215, SEQ ID NO: 217, 
SEQ ID NO: 219, SEQ ID NO: 221, SEQ ID NO: 223, SEQ ID NO: 225, SEQ ID 
NO: 227, SEQ ID NO: 229, SEQ ID NO: 231, SEQ ID NO: 233, SEQ ID NO: 235, 

10 
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SEQ ID NO: 237, SEQ ID NO: 239, SEQ ID NO: 241 , SEQ ID NO: 243, SEQ ID 
NO: 245, SEQ ID NO: 247, SEQ ID NO: 249, SEQ ID NO: 251, SEQ ID NO: 253, 
SEQ ID NO: 255, SEQ ID NO: 257, SEQ ID NO: 259, SEQ ID NO: 261, SEQ ID 
NO:, 263, SEQ ID NO:, 265, SEQ ID NO: 267, SEQ ID NO: 269, SEQ ID NO: 271, 
5 SEQ ID NO: 273, SEQ ID NO: 275, SEQ ID NO: 277, SEQ ID NO: 279, SEQ ID 
NO- 281, SEQ ID NO: 283, SEQ ID NO: 285, SEQ ID NO: 287, SEQ ID NO: 289, 
SEQ ID NO: 291, SEQ ID NO: 293, SEQ ID NO: 295, SEQ ID NO: 297, SEQ ID 
NO- 299, SEQ ID NO: 301, SEQ ID NO: 303, SEQ ID NO: 305, SEQ ID NO: 307, 
SEQ ID NO: 309, SEQ ID NO: 311, SEQ ID NO: 313, SEQ ID NO: 315, SEQ ID 
10 NO: 317, SEQ ID NO: 319, SEQ ID NO: 321, SEQ ID NO: 323, SEQ ID NO: 325, 
SEQ 3D NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 333, SEQ ID 
- NO: 335, SEQ ID NO: 337, SEQ ID NO: 339, SEQ ID NO: 341, SEQ ID NO: 343, 
SEQ ID NO: 345, SEQ ID NO: 347, SEQ ID NO: 349, SEQ ID NO: 351, SEQ ID 
NO: 353, SEQ ID NO: 355, SEQ ID NO: 357, SEQ ID NO: 359, SEQ ID NO: 361, 
15 SEQ ID NO: 363, SEQ ID NO: 365, SEQ ID NO: 367, SEQ ID NO: 369, SEQ ID 
NO: 371, SEQ ID NO: 373, SEQ ID NO: 375, SEQ ID NO: 377, SEQ ID NO: 379, 
SEQ ID NO: 381, SEQ ID NO: 383, SEQ ID NO: 385, SEQ ID NO: 387, SEQ ID 
NO: 389, SEQ ID NO: 391, SEQ ID NO: 393, SEQ ID NO: 395, SEQ ID NO: 397, 
SEQ ID NO: 399, SEQ ID NO: 401, SEQ ID NO: 403, SEQ ID NO: 405, SEQ ID 
20 NO: 407, SEQ ID NO: 409, SEQ ID NO: 411, SEQ ID NO: 413, SEQ ID NO: 415, 
SEQ ID NO: 417, SEQ ID NO: 419, SEQ ID NO: 421, SEQ ID NO: 423, SEQ ID 
NO: 425, SEQ ID NO: 427, SEQ ID NO: 429, SEQ ID NO: 431, SEQ ID NO: 433, 
SEQ ID NO: 435, SEQ ID NO: 437, SEQ ID NO: 439, SEQ ID NO: 441, SEQ ID 
NO: 443, SEQ ID NO: 445, SEQ ID NO: 447, SEQ ID NO: 449, SEQ ID NO: 451, 
25 SEQ ID NO: 453, SEQ ID NO: 455, SEQ ID NO: 457, SEQ ID NO: 459, SEQ ID 
NO: 461, SEQ ID NO: 463, SEQ ID NO: 465, SEQ ID NO: 467, SEQ ID NO: 469, 
SEQ ID NO: 471, SEQ ID NO: 473, SEQ ID NO: 475, SEQ ID NO: 477, SEQ ID 
NO: 479, SEQ ID NO: 481, SEQ ID NO: 483, SEQ ID NO: 485, SEQ ID NO: 487, 
SEQ ID NO: 489, SEQ ID NO: 491, SEQ ID NO: 493, SEQ ID NO: 495, SEQ ID 
30 NO: 497, SEQ ID NO: 499, SEQ ID NO: 501, SEQ ID NO: 503, SEQ ID NO: 505, 
SEQ ID NO: 507, SEQ ID NO: 509 and SEQ ID NO: 511. 

Also provided are methods of screening for modulators, e.g., activators, 
inhibitors, stimu-lators, enhancers, agonists, and antagonists, of the sensory receptors, 

11 
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or fragments or variants thereof. Such modulators of signal transduction are useful for 
pharmacological or genetic modu-lation of signaling pathways. These methods of 
screening can be used to identify high affinity agonists and antagonists of sensory cell 
activity. These modulator compounds can then be used in the pharmaceutical, food, 
5 and cosmetic industries to customize odorants or tastants. 

Thus, the invention provides assays for sensory modulation, where the sensory 
receptors, or fragments or variants thereof, act as direct or indirect reporter molecules 
for the effect of modulators on signal transduction. Sensory receptors, or fragments or 
variants thereof, can be used in assays, e.g., to measure changes in ion concentration, 

10 membrane potential, current flow, ion flux, transcription, signal transduction, 

receptor-ligand interaction, second messenger concentrations, in vitro, in vivo, and ex 
vivo. In one embodiment, sensory receptors, or fragments or variants thereof, can be 
used as an indirect reporters via attachment to second reporter molecules, such as 
green fluorescent protein {see, e.g., Mistili et al, Nature Biotech., 15:961, 1997). In 

1 5 another embodiment, the sensory receptors, or fragments or variants thereof, can be 
expressed in host cells, and modulation of signal transduction via sensory receptor 
activity can be assayed by measuring changes in Ca 2+ levels. 

Methods of assaying for modulators of signal transduction include in vitro 
ligand binding assays using the sensory receptors, or fragments or variants thereof. 

20 More particularly, such assays can use the sensory receptors; portions thereof such as 
the extracellular or transmembrane domains; chimeric proteins comprising one or 
more of such domains; oocyte receptor expression; tissue culture cell receptor 
expression; transcriptional activation of the receptor; G-protein binding to the 
receptor; ligand binding assays; voltage, membrane potential and conductance 

25 changes; ion flux assays; changes in intracellular second messengers such as cAMP 
and inositol triphosphate; changes in intracellular Ca 2+ levels; and neurotransmitter 
release. 

The invention also provides for methods of detecting sensory receptor nucleic 
-acid and protein expression, allowing for the investigation of taste transduction 
30 regulation and specific identification of sensory receptor cells. The sensory receptors, 
fragments, and variants of the invention can also be used to generate monoclonal and 
polyclonal antibodies useful for identi-fying a sensory receptor cells. Sensory receptor 
cells can be identified using techniques such as reverse transcription and amplification 

12 
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of rnRNA, isolation of total RNA or poly A + RNA, Northern blotting, dot blotting, in 
situ hybridization, RNase protection, SI digestion, probing DNA microchip arrays, 
Western blots, and the like. 

A. Identification and Characterization of Sensory Receptors 
5 The amino acid sequences of the sensory receptors and polypeptides of the 

invention can be identified by putative translation of the coding nucleic acid 
sequences. These various amino acid sequences and the coding nucleic acid 
sequences may be compared to one another or to other sequences according to a 
number of methods. 

1 0 For example, in sequence comparison, typically one sequence acts as a 

reference sequence, to which test sequences are compared. When using a sequence 
comparison algorithm, test and reference sequences are entered into a computer, 
subsequence coordinates are designated, if necessary, and sequence algorithm 
program parameters are designated. Default program parameters can be used, as 
1 5 described below for the BLASTN and BLASTP programs, or alternative parameters 
can be designated. The sequence comparison algorithm then calculates the percent 
sequence identities for the test sequences relative to the reference sequence, based on 
the program parameters. 

A "comparison window," as used herein, includes reference to a segment of 
20 any one of the number of contiguous positions selected from the group consisting of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in 
which a sequence may be compared to a reference sequence of the same number of. 
contiguous positions after the two sequences are optimally aligned. Methods of 
alignment of sequences for comparison are well-known in the art. Optimal alignment 
25 of sequences for comparison can be conducted, e.g. , by the local homology algorithm 
of Smith & Waterman, Adv. Appl. Math. 2:482, 1981, by the homology alignment 
algorithm of Needleman & Wunsch, J. Mol. Biol 48:443, 1970, by the search for 
similarity method of Pearson & Lipman, Proc. Natl. Acad Set USA 85:2444, 1988, by 
computerized implementations of these algorithms (GAP, BESTFTT, FAST A, and 
30 TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 
575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, 
e.g., Current Protocols in Molecular Biology (Ausubel et al, eds. 1995 supplement)). 
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A preferred example of an algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et al, NucL Acids Res. 25:3389,.1977 and Altschul et 
al, JMol Biol. 215:403, 1990, respectively. Software for performing BLAST 
5 analyses is publicly available through the National Center for Biotechnology 

Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying 
high scoring sequence pairs (HSPs) by identifying short words of length W in the 
query sequence, which either match or satisfy some positive- valued threshold score T 
when aligned with a word of the same length in a database sequence. T is referred to 

10 as the neighborhood word score threshold (Altschul et al, Altschul et al., NucL Acids 
Res. 25:3389, 1977 and Altschul et al., J. Mol Biol 215:403, 1990). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs 
containing them. The word hits are extended in both directions along each sequence 
for as far as the cumulative alignment score can be increased. Cumulative scores are 

15 calculated using, for nucleotide sequences, the parameters M (reward score for a pair 
of matching residues; always > 0) and N (penalty score for mismatching residues; 
always < 0). For amino acid sequences, a scoring matrix is used to calculate the 
cumulative score. Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved 

20 value; the cumulative score goes to zero or below, due to the accumulation of one or 
more negative-scoring residue alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of 
the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a 
-wordlength (W) of 1 1, an expectation (E) or 10, M=5, N=-4 and a comparison of both 

25 strands. For amino acid sequences, the BLASTP program uses as defaults a 

wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix {see 
Henikoff & HenikofT, Proc. Natl. Acad ScL USA 89:10915, 1989) alignments (B) of 
50, expectation (E) of 10, M=5, N— 4, and a comparison of both strands. 

Another example of a-useful algorithm is PILEUP. PILEUP creates a multiple 

30 sequence alignment from a group of related sequences using progressive, pairwise 
alignments to show relationship and percent sequence identity. It also plots a so- 
called "tree" or "dendogram" showing the clustering relationships used to create the 
alignment {see, e.g., Figure 2). PILEUP uses a simplification of the progressive 
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alignment method of Feng & Doolittle, J Mol. Evol. 35:351, 1987. The method used 
is similar to the method described by Higgins & Sharp, CABIOS 5:151, 1989. The 
program can align up to 300 sequences, each of a maximum length of 5,000 
nucleotides or amino acids. The multiple alignment procedure begins with the 
5 pairwise alignment of the two most similar sequences, producing a cluster of two 
aligned sequences. This cluster is then aligned to the next most related sequence or 
cluster of aligned sequences. Two clusters of sequences are aligned by a simple 
extension of the pairwise alignment of two individual sequences. The final alignment 
is achieved by a series of progressive, pairwise alignments. The program is run by 
10 designating specific sequences and their amino acid or nucleotide coordinates for 
regions of sequence comparison and by designating the program parameters. Using 
PILEUP, a reference sequence is compared to other test sequences to determine the 
percent sequence identity relationship using the following parameters: default gap 
weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can 
1 5 be obtained from the GCG sequence analysis software package, e.g., version 7. 0 

(Devereaux et al, Nucl. Acids Res. 12:387, 1984) encoded by the genes were derived 
by conceptual translation of the corresponding open reading frames. Comparison of 
these protein sequences to all known proteins in the public sequence databases using 
BLASTP algorithm revealed their strong homology to the members of the mammalian 
20 olfactory receptor family, each of the olfactory receptor sequences having at least 
50%, and preferably at least 55%, at least 60%, at least 65%, and most preferably at 
least 70%, amino acid identity to at least one known member of the family. 

The nucleic acid molecules of the present invention are typically intronless and 
encode putative sensory receptor proteins generally having lengths of about 300 to 
25 about 400 amino acid residues that contain seven transmembrane domains, as 

predicted by hydrophobicity plotting analysis, indicating that they belong to the G 
protein-coupled receptor superfamily, which includes the subset of taste and olfactory 
receptors. In addition to the overall structural similarity, each of the 256 sensory 
^receptors identified herein has a characteristic sequence signature of an olfactory 
30 receptor. In particular, all 256 sequences contain very close matches to the following 
consensus amino acid motifs (Mombaerts, 1999; Pilpel, 1999): LHTPMY in 
intracellular loop 1, MAYDRYVAIC at the end of transmembrane domain 3 and the 
beginning of intracellular loop 2 , SY at the end of transmembrane domain 5, 
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FSTCSSH in the beginning of transmembrane domain 6, and PMLNPF in 
transmembrane domain 7. Combination of all the above mentioned structural features 
of the 256 genes and encoded proteins strongly suggests that they represent novel 
members of the human olfactory receptor family. 
5 As noted above, complete or partial sequences of numerous human and other 

eukaryotic sensory receptors are currently known. The novel human receptors have 
amino acid sequences distinctly different from the previously known human sensory 
receptors, which suggests their different specificity in sensant recognition. Therefore, 
these novel sensory receptors and their genes can be used, alone or in combination 
10 with known sensory receptors, in developing detec-tion systems and assays for 

chemically distinct types of sensants not recognized by the known sensory receptors, 
as well as for diagnostic and research purposes. 

B. Definitions 

The terms "purified," "substantially purified," and "isolated" as used herein 

15 refer to the state of being free of other, dissimilar compounds with which the 

compound of the invention is normally associated in its natural state, so that the 
"purified," "substantially purified," and "isolated" subject comprises at least 0.5%, 
1%, 5%, 10%, or 20%, and most preferably at least 50% or 75% of the mass, by 
weight, of a given sample. In one preferred embodiment, these terms refer to the 

20 compound of the invention comprising at least 95% of the mass, by weight, of a given 
sample. As used herein, the terms "purified," "substantially purified," and "isolated" 
"isolated," when referring to a nucleic acid or protein, of nucleic acids or proteins, 
also refers to a state of purification or concentration different than that which occurs 
naturally in the mammalian, especially human, body. Any degree of purification or 

25 concentration greater than that which occurs naturally in the mammalian, especially 
human, body, including (1) the purification from other associated structures or 
compounds or (2) the association with structures or compounds to which it is not 
normally associated in the mammalian, especially human, body, are within the 
meaning of "isolated." The nucleic acid or protein or classes of nucleic acids or 

30 proteins, described herein, may be isolated, or otherwise associated with structures or 
compounds to which they are not normally associated in nature, according to a variety 
of methods and processes known to those of skill in the art. 
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The terms "amplifying" and "amplification" refer to the use of any suitable 
amplification methodology for generating or detecting recombinant or naturally 
expressed nucleic acid, as described in detail, below. For example, the invention 
provides methods and reagents (e.g., specific degenerate oligonucleotide primer pairs) 

5 for amplifying (e.g., by polymerase chain reaction, PCR) naturally expressed (e.g., 
genomic or mRNA) or recombinant (e.g., cDNA) nucleic acids of the invention (e.g., 
sensant-binding sequences of the invention) in vivo or in vitro. 

The term "7- transmembrane receptor" means a polypeptide belonging to a 
superfamily of transmembrane proteins that have seven domains that span the plasma 

1 0 membrane seven times (thus, the seven domains are called "transmembrane" or "TM" 
domains TM I to TM VU). The families of olfactory and taste receptors each belong 
to this super-family. Seven-transmembrane receptor polypeptides have similar and 
characteristic primary, secondary and tertiary structures, as discussed in further detail 
below. 

15 The term "expression vector" refers to any recombinant expression system for 

the purpose of expressing a nucleic acid sequence of the invention in vitro or in vivo, 
constitutively or inducibly, in any cell, including prokaryotic, yeast, fungal, plant, 
insect or mammalian cell. The term includes linear or circular expression systems. 
The term includes expression systems that remain episomal or integrate into the host 
20 cell genome. The expression systems can have the ability to self-replicate or not, i.e., 
drive only transient expression in a cell. The term includes recombinant expression 
"cassettes which contain only the minimum elements needed for transcription of the 
recombinant nucleic acid. 

The term "library" means a preparation that is a mixture of different nucleic 
25 acid or polypeptide molecules, such as the library of recombinant^ generated sensory, 
particularly olfactory or taste, receptor ligand-binding domains generated by 
amplification of nucleic acid with degenerate primer pairs", or an isolated collection of 
vectors that incorporate the amplified sensant-binding domains, or a mixture of cells 
each randomly transfected with at least one vector encoding a sensory receptor. 
30 The term "nucleic acid" or "nucleic acid sequence" refers to a deoxy- 

ribonucleotide or ribonucleotide oligonucleotide in either single- or double-stranded 
form. The term encompasses nucleic acids, i.e., oligonucleotides, containing known 
analogs of natural nucleotides. The term also encompasses nucleic-acid-like 
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structures with synthetic backbones, see e.g., Oligonucleotides and Analogues, a 
Practical Approach, ed. F. Eckstein, Oxford Univ. Press (1991); Anti-sense 
Strategies, Annals of the N.Y. Academy of Sciences, Vol. 600, Eds. Baserga et aL 
(NYAS 1992); Milligan (1993) J. Med. Chem. 36:1923-1937; Antisense Research and 
5 Applications (1993, CRC Press), WO 97/0321 1; WO 96/39154; Mata (1997) Toxicol 
Appl Pharmacol 144:189-197; Strauss-Soukup (1997) Biochemistry 36:8692-8698; 
Samstag (1996) Antisense Nucleic Acid DrugDev 6:153-156. 

The term sensory receptor "ligand-binding region" refers to sequences derived 
from a sensory receptor that substantially incorporates transmembrane domains II to 

1 0 VII (TM II to VH). The domain may be capable of binding a sensant. 

The terms "conservative variant" or "analog" or "mimetic" refer to a 
polypeptide which has a modified amino acid sequence, such that the change(s) do not 
substantially alter the poly-peptide's (the conservative variant's) structure and/or 
activity, as defined herein. These include conservatively modified variations of an 

15 amino acid sequence, i.e., amino acid substitutions, additions or deletions of those 
residues that are not critical for protein activity, or substitution of amino acids with 
residues having similar properties (e.g., acidic, basic, positively or negatively charged, 
polar or non-polar, etc.) such that the substitutions of even critical amino acids does 
not substantially alter structure and/or activity. Conservative substitution tables 

20 providing function-ally similar amino acids are well known in the art. 

For example, one exemplary guideline to select conservative substitutions 
includes (original residue followed by exemplary substitution): Ala/Gly or Ser; 
Arg/Lys; Asn/Gln or His; Asp/Glu; Cys/Ser; Gln/Asn; Gly/Asp; Gly/Ala or Pro; 
His/Asn or Gin; He/Leu or Val; Leu/Ile or val; Lys/Arg or Gin or Glu; Met/Leu or Tyr 

25 or lie; Phe/Met or Leu or Tyr; Ser/Thr; Thr/Ser; Trp/Tyr; Tyr/Tip or Phe; Val/Ile or 
Leu. An alternative exemplary guideline uses the following six groups, each 
containing amino acids that are conservative substitutions for one another: 1) Alanine 

(A), Serine (S), Threonine (T);.2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine 

(N), Glutamine (Q); 4) Arginine (R), Lysine (I); 5) Isoleucine (I), Leucine (L), 

30 Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 
(see also, e.g., Creighton, Proteins, W.H. Freeman, 1984; Schultz & Schimer, 
Principles of Protein Structure, Springer- Verlag, 1979). One of skill in the art will 
appreciate that the above-identified substitutions are not the only possible 
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conservative substitutions. For example, for some purposes, one may regard all 
charged amino acids as conservative substitutions for each other whether they are 
positive or negative. In addition, individual substitutions, deletions or additions that 
alter, add or delete a single amino acid or a small percentage of amino acids in an 

5 encoded sequence can also be considered "conservatively modified variations." 

The terms "mimetic" and "peptidomimetic" refer to a synthetic chemical 
compound that has substantially the same structural and/or functional characteristics 
of the polypeptides, e.g., translocation domains or sensant-binding domains or 
chimeric receptors of the invention. The mimetic can be either entirely composed of 

10 synthetic, non-natural analogs of amino acids, or, is a chimeric molecule of partly 
natural peptide amino acids and partly non-natural analogs of amino acids. The 
mimetic can also incorporate any amount of natural amino acid conservative substi- 
tutions as long as such substitutions also do not substantially alter the mimetic's 
structure and/or activity. As with polypeptides of the invention which are 

1 5 conservative variants, routine experi-mentation will determine whether a mimetic is 
within the scope of the invention, i.e., that its structure and/or function is not 
substantially altered. Polypeptide mimetic compositions can contain any combination 
of non-natural structural components, which are typically from three structural groups: 
a) residue linkage groups other than the natural amide bond ("peptide bond") linkages; 

20 b) non-natural residues in place of naturally occurring amino acid residues; or c) 
residues which induce secondary structural mimicry, i.e., to induce or stabilize a 
secondary structure, e.g., a beta turn, gamma turn, beta sheet, alpha helix 
conformation, and the like. A polypeptide can be characterized as a mimetic when all 
or some of its residues are joined by chemical means other than natural peptide bonds. 

25 Individual peptidomimetic residues can be joined by peptide bonds, other chemical 
bonds or coupling means, such as, e.g., glutaraldehyde, N-hydroxysuccinimide esters, 
Afunctional maleimides, N,N'-dicyclohexylcarbodiimide (DCC) or 
diisopropylcarbodiimide (DIC). Linking groups that can be an alternative to the 
traditional amide bond ("peptide bond") linkages include, e.g., ketomethylene {e.g., 

30 -C(=0)-CH 2 - for -C(=0)-NH-), aminomethylene (CH 2 -NH), ethylene, olefin 

(CH=CH), ether (CH 2 -0), thioether (CH 2 -S), tetrazole (CN 4 ), thiazole, retroamide, 
thioamide, or ester (see, e.g., Spatola (1 983) in Chemistry and Biochemistry of Amino 
Acids, Peptides and Proteins, Vol. 7, pp 267-357, "Peptide Backbone Modifications," 
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Marcell Dekker, NY). A polypeptide can also be characterized as a mimetic by 
containing all or some non-natural residues in place of naturally occurring amino acid 
residues; non-natural residues are well described in the scientific and patent literature. 
As used herein, "recombinant** refers to a polynucleotide synthesized or 
5 otherwise manipulated in vitro {e.g., "recombinant polynucleotide"), to methods of 
using recombinant polynucleotides to produce gene products in cells or other 
biological systems, or to a polypeptide ("recombinant protein") encoded by a 
recombinant polynucleotide. "Recombinant means" also encompass the ligation of 
nucleic acids having various coding regions or domains or promoter sequences from 

10 different sources into an expression cassette or vector for expression of, e.g., inducible 
or constitutive expression of a fusion protein comprising a translocation domain of the 
invention and a nucleic acid sequence amplified using a primer of the invention. 

The term "transmembrane domain*' means a polypeptide domain that can 
completely span the plasma membrane. The general secondary and tertiary structure 

1 5 of transmembrane domains, in particular the seven transmembrane domains of 7- 

transmembrane receptors such as olfactory receptors, are well known in the art. Thus, 
primary structure sequence can be designed or predicted based on known 
transmembrane domain sequences, as described in detail below. 

C. Isolation and Expression of Olfactory Receptors 

20 Isolation and expression of the sensory receptors, or fragments or variants 

thereof, of the invention can be performed as described below. PCR primers can be 
used for the amplification of nucleic acids encoding olfactory receptor ligand binding 
regions and libraries of these nucleic acids can thereby be generated. Libraries of 
expression vectors can then be used to infect or transfect host cells for the functional 

25 expression of these libraries. These genes and vectors can be made and expressed in 
vitro or in vivo. One of skill will recognize that desired phenotypes for altering and 
controlling nucleic acid expression can be obtained by modulating the expression or 
activity of the genes and nucleic acids (e.g., promoters, enhancers and the like) within 
the vectors of the invention. Any of the known methods described for increasing or 

30 decreasing expression or activity can be used. The invention can be practiced in 

conjunction with any method or protocol known in the art, which are well described in 
the scientific and patent literature. 
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The nucleic acid sequences of the invention and other nucleic acids used to 
practice this invention, whether RNA, cDNA, genomic DNA, vectors, viruses or 
hybrids thereof, maybe isolated from a variety of sources, genetically engineered, 
amplified, and/or expressed recombinantly. Any recombinant expression system can 
5 be used, including, in addition to mammalian cells, e.g. , bacterial, yeast, insect or 
plant systems. 

Alternatively, these nucleic acids can be synthesized in vitro by well-known 
chemical synthesis techniques, as described in, e.g., Carruthers, Cold Spring Harbor 
Symp. Quant. Biol. 47:411-418 (1982); Adams, Am. Chem. Soc. 105:661 (1983); 
10 Belousov, Nucleic Acids Res. 25:3440-3444 (1997); Frenkel, Free Radic. Biol. Med. 
19:373-380 (1995); Blommers, Biochemistry 33:7886-7896 (1994); Narang, Meth. 
Enzymol. 68:90 (1979); Brown, Meth. Enzymol. 68:109 (1979); Beaucage, Tetra. Lett. 
22:1859 (1981); U.S. Patent No. 4,458,066. Double-stranded DNA fragments may 
then be obtained either by synthesizing the complementary strand and annealing the 
15 strands together under appropriate conditions, or by adding the complementary strand 
using DNA polymerase with an appropriate primer sequence- 
Techniques for the manipulation of nucleic acids, such as, for example, for 
generating mutations in sequences, subcloning, labeling probes, sequencing, 
hybridization and the like are well described in the scientific and patent literature. 
20 See, e.g., Sambrook, ed., Molecular Cloning: a Laboratory manual (2nd ed.), Vols. 1- 
3, Cold Spring Harbor Laboratory (1989); Current Protocols in Molecular Biology, 
Ausubel, ed. John Wiley & Sons, Inc., New York (1997); Laboratory Techniques in 
Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes, Part I, 
Theory and Nucleic Acid Preparation, Tijssen, ed. Elsevier, N.Y. (1993). 
25 Nucleic acids, vectors, capsids, polypeptides, and the like can be analyzed and 

quantified by any of a number of general means well known to those of skill in the art. 
These include, e.g., analytical biochemical methods such as NMR, spectrophotometry, 
radiography, electrophoresis, capillary electrophoresis, high performance liquid 
chromatography (HPLC), thin layer chromato-graphy (TLC), and hyperdiffusion 
30 chromatography, various immunological methods, e.g., fluid or gel precipitin 
reactions, immunodiffusion, immunoelectrophoresis, radioimmunoassay (RIA), 
enzyme-linked immunosorbent assay (ELISA), immunofluorescent assay, Southern 
analysis, Northern analysis, dot-blot analysis, gel electrophoresis (e.g., SDS-PAGE), 

21 
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RT-PCR, quantita-tive PCR, other nucleic acid or target or signal amplification 
methods, radiolabeling, scintillation counting, and affinity chromatography. 

Oligonucleotide primers are used to amplify nucleic acid encoding an olfactory 
receptor ligand-binding region. The nucleic acids described herein can also be cloned 
5 or measured quan-titatively using amplification techniques. Using exemplary 
degenerate primer pair sequences, (see below), the skilled artisan can select and 
design suitable oligonucleotide amplification primers. Amplification methods are also 
well known in the art, and include, e.g., polymerase chain reaction, PCR (PCR 
Protocols, a Guide to Methods and Applications, ed. Innis. Academic Press, NY, 1990 
10 and PCR Strategies, ed. Innis, Academic Press, NY, 1995), ligase chain reaction 
(LCR) (see, e.g., Wu, Genomics 4:560, 1989; Landegren, Science 241:1077, 1988; 
Barringer, Gene 89:1 17, 1990); transcription amplification (see, e.g., Kwoh, Proc. 
Natl Acad. Sci. USA 86:1173, 1989); and, self- sustained sequence replication (see, 
e.g., Guatelli, Proc. Natl Acad. Sci. USA 87:1874, 1990); Q Beta replicase 
15 amplification (see, e.g., Smith, J. Clin. Microbiol. 35:1477, 1997); automated Q-beta 
replicase amplification assay (see, e.g., Burg, Mol. Cell Probes 10:257, 1996) and 
other RNA polymerase mediated techniques (e.g., NASBA, Cangene, Mississauga, 
Ontario); see also Berger, Methods Enzymol 152:307, 1987; Sambrook; Ausubel; 
U.S. Patent Nos. 4,683,195 and 4,683,202; Sooknanan, Biotechnology 13:563, 1995. 

20 Once amplified, the nucleic acids, either individually or as libraries, may be 

cloned according to methods known in the art, if desired, into any of a variety of 
vectors using routine molecular biological methods; methods for cloning in vitro 
amplified nucleic acids are described, e.g., U.S. Patent No. 5,426,039. To facilitate 
cloning of amplified sequences, restriction enzyme sites can be "built into" the PGR 

25 primer pair. For example, Pst I and Bsp El sites were designed into the exemplary 
primer pairs of the invention. These particular restriction sites have a sequence that, 
when ligated, are "in-frame" with respect to the 7-membrane receptor "donor" coding 
sequence into which they are spliced (the sensant-binding region-coding sequence is 
internal to the 7-membrane polypeptide, thus, if it is desired that the construct be 

30 translated downstream of a restriction enzyme splice site, out of frame results should 
be avoided; this may not be necessary if the inserted sensant-binding domain 
comprises substantially most of the transmembrane VII region). The primers can be 
designed to retain the original sequence of the "donor" 7-membrane receptor (the Pst I 
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and Bsp El sequence in he primers of the invention generate an insert that, when 
ligated into the Pst 1/Bsp El cut vector, encode residues found in the "donor" mouse 
olfactory receptor M4 sequence). Alternatively, the primers can encode amino acid 
residues that are conservative substitutions (e.g., hydrophobic for hydrophobic 
5 residue, see above discussion) or functionally benign substitutions (e.g. , do not 
prevent plasma membrane insertion, cause cleavage by peptidase, cause abnormal 
folding of receptor, and the like). 

The primer pairs are designed to selectively amplify sensant-binding regions of 
olfactory receptor proteins. These domain regions may vary for different sensnants, 
1 0 and more particularly odorants; thus, what may be a minimal binding region for one 
sensant, and more particularly odorants, may be too limiting for a second potential 
ligand. Thus, domain regions of different sizes comprising different domain 
structures may be amplified; for example, transmembrane (TM) domains E through 
VE, HI through VE, IE through VI or H through VI, or variations thereof (e.g., only a 
1 5 subsequence of a particular domain, mixing the order of the domains, and the like), of 
a 7-transmembrane sensory receptor. 

As domain structures and sequence of many 7-membrane proteins, particularly 
olfactory receptors, are known, the skilled artisan can readily select domain-flanking 
and internal domain sequences as model sequences to design degenerate amplification 
20 primer pairs. For example, a nucleic acid sequence encoding domain regions E 

through VE can be generated by PCR amplification using a primer pair. To amplify a 
nucleic acid comprising transmembrane domain I (TM I) sequence, a degenerate 
primer can be designed from a nucleic acid that encodes the amino acid sequence 
LFLLYL. Such a degenerate primer can be used to generate a binding domain 
25 incorporating TM I through TM EI, TM I through TM IV, TM I through TM V, TM I 
through TM VI or TM I through TM VE. 

To amplify a nucleic acid comprising a transmembrane domain EI (TM IE) 
sequence, a degenerate primer (of at least about 17 residues) can be designed from a 
nucleic acid that encodes the amino acid sequence M(A/G)(Y/F)DRYVAI (encoded 

30 by a nucleic acid sequence such as 

5'-ATGG(G/C)CT(AA-)TGACCG(C/A/T)T(AT)(C/T)GT- 3')- Such a degenerate 

primer can be used to generate a binding domain incorporating TM EI through TM IV, 

TM IE through TM V, TM EI through TM VI or TM IE through TM VE. 
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To amplify a transmembrane domain VI (TM VI) sequence, a degenerate 
primer (of at least about 17 residues) can be designed from nucleic acid encoding an 
amino acid sequence TC(G/A)SHL, encoded by a sequence such as 5'- 
AG(G/A)TGN(G/C)(T/A)N(G/C)C(G/A)CA-NGT-3'. Such a degenerate primer can 
5 be used to generate a binding domain incorporating TM I through TM VI, TM II 
through TM VI, TM m through TM VI or TM IV through TM VI). 

Paradigms to design degenerate primer pairs are well known in the art. For 
example, a COnsensus-DEgenerate Hybrid Oligonucleotide Primer (CODEHOP) 
strategy computer program is accessible as http://blocks.fhcrc.org/codehop.html, and 

10 is directly linked from the BlockMaker multiple sequence alignment site for hybrid 
primer prediction beginning with a set of related protein sequences, as known 
olfactory receptor ligand-binding regions {see, e.g., Rose, Nucl. Acids Res. 26:1628, 
1 998; Singh, Biotechniques 24:3 1 8, 1 998). 

Means to synthesize oligonucleotide primer pairs are well known in the art. 

15 "Natural" base pairs or synthetic base pairs can be used. For example, use of artificial 
nucleobases offers a versatile approach to manipulate primer sequence and generate a 
more complex mixture of amplification products. Various families of artificial 
nucleobases are capable of assuming multiple hydrogen bonding orientations through 
internal bond rotations to provide a means for degenerate molecular recognition. 

20 Incorporation of these analogs into a single position of a PCR primer allows for 

generation of a complex library of amplification products. See, e.g., Hoops, Nucleic 
Acids Res. 25:4866, 1997. Nonpolar molecules can also be used to mimic the shape 
of natural DNA bases. A non-hydrogen-bonding shape mimic for adenine can 
replicate efficiently and selectively against a nonpolar shape mimic for thymine (see, 

25 e.g., Morales, Nat. Struct. Biol. 5:950, 1998). For example, two degenerate bases can 
be the pyrimidine base 6H, 8H-3,4-dihydropyrimido[4,5-c][l,2]oxazin-7-one or the 
purine base N6-methoxy-2,6-diaminopurine (see, e.g., Hill, Proc. Natl Acad. Sci. 
USA 95:4258, 1998). Exemplary degenerate primers of the invention incorporate the 
nucleobase analog S^-Dimethoxytrityl-N-benzoyl^'-deoxy-Cytidine, 3 '-[(2- 

30 cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite (the term "P" in the sequences, see 
above). This pyrimidine analog hydrogen bonds with purines, including A and G 
residues. 
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Exemplary primer pairs for amplification of olfactory receptor transmembrane 

domains II through VII include: 

(a) 5 '-GGGGTCCGGAG(A/G)(C7G)(A/G)TA(A/G/T)AT(A/G/P)A(AyG/P)(A/G/P)GG- 

3' and 

5 5'- 

GGGGCTGCAGACACC(A/C/G/T)ATGTA(C/T)(C/T)T(A/C/G/T)TT(C/T)(C/T)T- 

3'. 

(b) 5'-GGGGTCCGGAG(A/G)(C/G)T(A/G)A(A/G/T)AT(A/G/P)A(A/G/P)(A/G/P)GG- 
3' and 

10 5'- 

GGGGCTGCAGACACC(AC/G/T)ATGTA(C/T)(C/r)T(A/C/G/T)TT(C/T)(C/T)T- 

3'. 

(c) 5'- 

GGGGTCCGGAG(A/G)(C/G)T(A/G)A(A/GyT)AT(A/G/C/T)A(A/G/C/T)(A/G/C/T) 

15 GG-3'and 

5 , -GGGGCTGCAGACACC(A/C/G/T)ATGTA(C/T)(C/T)T(A/CyG/T) 

TT(C/T)(C/T)T-3'. 

Nucleic acids that encode ligand-binding regions of olfactory receptors are 
generated by amplification (e.g., PCR) of appropriate nucleic acid sequences using 
20 degenerate primer pairs. The amplified nucleic acid can be genomic DNA from any 
cell or tissue or mRNA or cDNA derived from olfactory receptor-expressing cells, 
e.g., olfactory neurons or olfactory epithelium. 

Isolation from olfactory receptor-expressing cells is well known in the art 
(cells expressing naturally or inducibly expressing olfactory receptors can be used to 
25 express the hybrid olfactory receptors of the invention to screen for potential odorants 
and odorant effect on cell physiology, as described below). For example, cells can be 
identified by olfactory marker protein (OMP), an abundant cytoplasmic protein 
expressed almost exclusively in mature olfactory sensory neurons (see, e.g., Buiakova, 
Proc. Natl. Acad. Sci. USA 93:9858, 1996). Shirley, Eur. J. Biochem. 32:485, 1983), 
30 describes a rat olfactory preparation suitable for biochemical studies in vitro on 

olfactory mechanisms. Cultures of adult rat olfactory receptor neurons are described 
by Vargas, Chem. Senses 24:21 1, 1999). Because these cultured neurons exhibit 
typical voltage-gated currents and are responsive to application of odorants, they can 
also be used to express the hybrid olfactory receptors of the invention for odorant 
35 screening (endogenous olfactory receptor can be initially blocked, if desired, by, e.g., 
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antisense, knockout, and the like). U.S. Patent No. 5,869,266 describes culturing 
human olfactory neurons for neurotoxicity tests and screening. Murrell, J. NeuroscL 
19:8260, 1999), describes differentiated olfactory receptor-expressing cells in culture 
that respond to odorants, as measured by an influx of calcium. 
5 Hybrid protein-coding sequences comprising nucleic acids sensory receptors 

fused to the translocation sequences described herein maybe constructed. Also 
provided are hybrid receptors comprising the translocation motifs and ligand-binding 
domains of sensory receptors. These nucleic acid sequences can be operably linked to 
transcriptional or translational control elements, e.g., transcription and translation 

10 initiation sequences, promoters and enhancers, transcription and translation 

terminators, polyadenylation sequences, and other sequences useful for transcribing 
DNA into RNA. In construction of recombinant expression cassettes, vectors, 
transgenics, and a promoter fragment can be employed to direct expression of the 
desired nucleic acid in all tissues. Olfactory cell-specific transcriptional elements can 

15 also be used to express the fusion polypeptide receptor, including, e.g., a 6.7 kb region 
upstream of the M4 olfactory receptor coding region. This region was sufficient to 
direct expression in olfactory epithelium with wild type zonal restriction and 
distributed neuronal expression for endogenous olfactory receptors (Qasba, J. 
NeuroscL 18:227, 1998). Receptor genes are normally expressed in a small subset of 

20 neurons throughout a zonally restricted region of the sensory epithelium. The 

transcriptional or translational control elements can be isolated from natural sources, 
obtained from such sources as ATCC or GenBank libraries, or prepared by synthetic 
or recombinant methods. 

Fusion proteins, either having C-terminal or, more preferably, N-terminal 

25 translocation sequences, may also comprise the translocation motif described herein. 
However, these fusion proteins can also comprise additional elements for, e.g., protein 
detection, purification, or other applications. Detection and purification facilitating 
. domains include, e.g. , metal chelating peptides such as pqlyhistidine tracts or 
histidine-tryptophan modules or other domains that allow purification on immobilized 

30 metals; maltose binding protein; protein A domains that allow purification on 
immobilized immunoglobulin; or the domain utilized in the FLAGS extension/ 
affinity purification system (Immunex Corp, Seattle WA). 
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The inclusion of a cleavable linker sequences such as Factor Xa (see, e.g., 
Ottavi, Biochi-mie 80:289, 1998), subtilisin protease recognition motif (see, e.g., 
Polyak, Protein Eng. 10:615, 1997); enterokinase (Invitrogen, San Diego, CA), and 
the like, between the translocation domain (for efficient plasma membrane 
5 expression) and the rest of the newly translated polypeptide may be useful to facilitate 
purification. For example, one construct can include a nucleic acid sequence 
encoding a polypeptide linked to six histidine residues followed by a thioredoxin, an 
enterokinase cleavage site (see, e.g., Williams, Biochemistry 34:1787, 1995), and an 
amino terminal translocation domain. The histidine residues facilitate detection and 
1 0 purification while the enterokinase cleavage site provides a means for purifying the 
desired protein(s) from the remainder of the fusion protein. Technology pertaining to 
vectors encoding fusion proteins and application of fusion proteins are well described 
in the scientific and patent literature, see, e.g., Kroll, DNA Cell. Biol. 12:441, 1993). 
Expression vectors, either as individual expression vectors or as libraries of 
1 5 expression vectors, comprising the olfactory binding domain-encoding sequences may 
be introduced into a genome or into the cytoplasm or a nucleus of a cell and expressed 
by a variety of conventional techniques, well described in the scientific and patent 
literature. See, e.g., Roberts, Nature 328:731, 1987; Berger supra; Schneider, Protein 
Expr. Purif. 6435:10, 1995; Sambrook; Tijssen; Ausubel. Product information from 
20 manufacturers of biological reagents and experimental equipment also provide 

information regarding known biological methods. The vectors can be isolated from 
natural sources, obtained from such sources as ATCC or GenBank libraries, or 
prepared by synthetic or recombinant methods. 

The nucleic acids can be expressed in expression cassettes, vectors or viruses 
25 which are stably or transiently expressed in cells (e.g., episomal expression systems). 
Selection markers can be incorporated into expression cassettes and vectors to confer 
a selectable phenotype on transformed cells and sequences. For example, selection 
markers can code for episomal maintenance and replication such that integration into 
the host genome is not required. For example, the marker may encode antibiotic 
30 resistance (e.g., chloramphenicol, kanamycin, G418, bleomycin, hygromycin) or 

herbicide resistance (e.g., chlorosulfuron or Basta) to permit selection of those cells 
transformed with the desired DNA sequences (see, e.g., Blondelet-Rouault, Gene 
190:315, 1997; Aubrecht,7. Pharmacol. Exp. Titer. 281:992, 1997). Because 

27 



NSDOCID: <WO 01 98526A2_I_> 



WO 01/98526 ' PO7US01/20122 

selectable marker genes conferring resistance to substrates like neomycin or 
hygromycin can only be utilized in tissue culture, chemoresistance genes are also used 
as selectable markers in vitro and in vivo. 

A chimeric nucleic acid sequence may encode a sensant-binding domain 
5 within any 7-transmembrane polypeptide. Seven-transmembrane receptors belong to 
a superfamily of trans-membrane (TM) proteins having seven domains that traverse a 
plasma membrane seven times. Each of the seven domains spans the plasma 
membrane (TM I to TM VII). Because 7-trans-membrane receptor polypeptides have 
similar primary sequences and secondary and tertiary structures, structural domains 
10 (e.g., TM domains) can be readily identified by sequence analysis. For example, 

homology modeling, Fourier analysis and helical periodicity detection can identify and 
characterize the seven domains with a 7-transmembrane receptor sequence. Fast 
Fourier Transform (FFT) algorithms can be used to assess the dominant periods that 
characterize profiles of the hydrophobicity and variability of analyzed sequences. To 
15 predict TM domains and their boundaries and topology, a "neural network algorithm" 
by "PHD server" can be used, as done by Pilpel, Protein Science 8:969, 1999; Rost, 
Protein Sci. 4:521, 1995. Periodicity detection enhancement and alpha helical 
periodicity index can be done as by, e.g., Donnelly, Protein ScL 2:55-70 (1993). 
Other alignment and modeling algorithms are well known in the art, see, e.g., Peitsch, 
20 Receptors Channels 4:161,1 996; Cronet, Protein Eng. 6:59, (1 993) (homology and 
"discover modeling"); http://bioinfo.weizmann.ac.il/. 

The library sequences include receptor sequences that correspond to TM 
ligand-binding domains, including, e.g., TM II to VII, TM II to VI, TM m to VTI, and 
TM III to-Vn, that have been amplified (e.g^ PCR) from-mRNA of or cDNA derived 
25 from, e.g., olfactory receptor-expressing neurons or genomic DNA. 

Libraries of sensory receptor ligand-binding TM domain sequences can 
include a various TM domains or variations thereof, as described above. These 
sequences can be derived from any 7-transmembrane receptor. Because these 
polypeptides have similar primary sequences and secondary and tertiary structures, the 
30 seven domains can be identified by various analyses well known in the art, including, 
e.g., homology modeling, Fourier analysis and helical periodicity (see, e.g., Pilpel 
supra), as described above. Using this information sequences flanking the seven 
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domains can be identified and used to design degenerate primers for amplification of 
various combinations of TM regions and subsequences. 

The present invention also includes not only the DNA and proteins having the 
specified amino acid sequences, but also DNA fragments, particularly fragments of, 
5 for example, 40, 60, 80, 100, 150, 200, or 250 nucleotides, or more, as well as protein 
fragments of, for example, 10, 20, 30, 50, 70, 100, or 150 amino acids, or more. 

Also contemplated are chimeric proteins, comprising at least 10, 20, 30, 50, 
70, 100, or 150 amino acids, or more, of one of at least one of the sensory receptors 
described herein, coupled to additional amino acids representing all or part of another 
10 G protein receptor, preferably a member of the 7-transmembrane superfamily. These 
chimerae can be made from the instant receptors and a G protein receptor described 
herein, or they can be made by combining two or more of the present proteins. In one 
preferred embodiment, one portion of the chimera corresponds to and is derived from 
one or more of the domains of the 7-transmembrane protein described herein, and the 
1 5 remaining portion or portions come from another G protein-coupled receptor. 

Chimeric receptors are well known in the art, and the techniques for creating them and 
the selection and boundaries of domains or fragments of G protein-coupled receptors 
for incorporation therein are also well known. Thus, this knowledge of those skilled 
in the art can readily be used to create such chimeric receptors. The use of such 
20 chimeric receptors can provide, for example, an olfactory selectivity characteristic of 
one of the receptors specifically disclosed herein, coupled with the signal transduction 
characteristics of another receptor, such as a well known receptor used in prior art 
assay systems. 

For example, a domain such as a ligand binding domain, an extracellular 
25 domain, a transmembrane domain (e.g., one comprising seven transmembrane regions 
and corresponding extracellular and cytosolic loops), the transmembrane domain and 
a cytoplasmic domain, an active site, a subunit association region, etc., can be 
covalently linked to a heterologous protein. For instance, an extracellular domain can 

be linked to a heterologous GPCR transmembrane domain, or a heterologous CGPCR 

30 extracellular domain can be linked to a transmembrane domain. Other heterologous 
proteins of choice can include, e.g., green fluorescent protein, p-gal, glutamtate 
receptor, and the rhodopsin presequence. 
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Polymorphic variants, alleles, and interspecies homologs that are substantially 
identical to a sensory receptor disclosed herein can be isolated using the nucleic acid 
probes described above. Alternatively, expression libraries can be used to isolate 
sensory receptors and polymorphic variants, alleles, and interspecies homologs 
5 thereof, by detecting expressed homologs immunologically with antisera or purified 
antibodies made against a sensory receptor-derived polypeptide, which also recognize 
and selectively bind to the sensory receptor homolog. 

Also within the scope of the invention are host cells for expressing the sensory 
receptors, fragments, or variants of the invention. To obtain high levels of expression 
10 of a cloned gene or nucleic acid, such as cDNAs encoding the sensory receptors, 

fragments, or variants thereof, the nucleic acid sequence of interest is subcloned into 
an expression vector that contains a strong promoter to direct transcription, a 
transcription/translation terminator, and if for a nucleic acid encoding a protein, a 
ribosome binding site for translational initiation. Suitable prokaryotic and eukaryotic 
15 expression systems are well known in the art and described, e.g., in Sambrook et ah 
Any of the well known procedures for introducing foreign nucleotide 
sequences into host cells may be used. These include the use of calcium phosphate 
transfection, polybrene, protoplast fusion, electroporation, liposomes, microinjection, 
plasmid vectors, viral vectors and any of the other well known methods for 
20 introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic 
material into a host cell (see, e.g., Sambrook et ah). It is only necessary that the 
particular genetic engineering procedure used be capable of successfully introducing 
at lest one gene into the host cell capable of expressing the olfactory receptor, 
fragment, or variant of interest. 
25 After the expression vector is introduced into the cells, the transfected cells are 

cultured under conditions favoring expression of the receptor, fragment, or variant of 
interest, which is then recovered from the culture using standard techniques. 
Examples of such techniques are well known in the art. See, e.g., WO 00/06593, 
which is incorporated by reference in a manner consistent with this disclosure. 

30 D- Immunological Detection of Sensory Receptor Polypeptides 

In addition to the detection of sensory receptor genes and gene expression 
using nucleic acid hybridization technology, one can also use immunoassays to detect 
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sensory receptors, e.g. , to identify olfactory receptor cells, and variants of sensory 
receptor family members. Immunoassays can be used to qualitatively or quantitatively 
analyze the sensory receptors. A general overview of the applicable technology can 
be found in Harlow & Lane, Antibodies: A Laboratory Manual (1988). 
5 i. Antibodies to sensory receptor fa mily members 

Methods of producing polyclonal and monoclonal antibodies that react 
specifically with a sensory receptor family member are known to those of skill in the 
art {see, e.g. Coligan, Current Protocols in Immunology, 1991; Coding, Monoclonal 
Antibodies: Principles and Practice, 2d ed., 1986; Harlow & Lane, supra; and Kohler 
10 &Milstein, Na/ure, 256:495, 1975). Such techniques include antibody preparation by 
selection of antibodies from libraries of recombinant antibodies in phage or similar 
vectors, as well as preparation of polyclonal and monoclonal antibodies by 
immunizing rabbits or mice (see, e.g., Huse et al, Science, 246:1275, 1989; Ward et 

al, Nature, 341:544, 1989). 
1 5 A number of sensory receptor-comprising immunogens may be used to 

produce antibody specifically reactive with a sensory receptor family member. For 
example, a recombinant sensory receptor protein, or an antigenic fragment thereof, 
can be isolated as described herein. Suitable antigenic regions include, e.g., the 
conserved motifs that are used to identify members of the sensory receptor family. 
20 Recombinant proteins can be expressed in eukaryotic or prokaryotic cells as described 
above, and purified as generally described above. Recombinant protein is the 
preferred immunogen for the production of monoclonal or polyclonal antibodies. 
Alternatively, a synthetic peptide derived from the sequences disclosed herein and 
conjugated to a carrier protein can be used an immunogen. Naturally occurring 
25 protein may also be used either in pure or impure form. The product is then injected 
into an animal capable of producing antibodies. Either monoclonal or polyclonal 
antibodies maybe generated, for subsequent use in immunoassays to measure the 
protein. 

Methods of production of polyclonal antibodies are known to those of skill in 
30 the art. Mice, hamsters, rats, guinea pigs, rabbits, goats, or chickens is immunized 
with the protein using an adjuvant (e.g., Freund's adjuvant) and a standard 
immunization protocol with periodic boosts. The animal's immune response to the 
immunogen preparation is monitored by taking test bleeds and determining the titer of 
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reactivity to the sensory receptor. When appropriately high titers of antibody to the 
immunogen are obtained, blood is collected from the animal and antisera are prepared. 
Further fractionation of the antisera to enrich for antibodies reactive to the protein can 
be done if desired (see Harlow & Lane, supra). 
5 Monoclonal antibodies may be obtained by various techniques familiar to 

those skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a myeloma cell (see Kohler & 
Milstein, Eur. J. Immunol., 6:51 1, 1976). Alternative methods of immortalization 
include transformation with Epstein Barr Virus, oncogenes, retroviruses, or other 

10 methods well known in the art. Colonies arising from single clones of immortalized 
cells are screened for production of antibodies of the desired specificity and affinity 
for the antigen, and yield of the monoclonal antibodies produced by such cells may be 
enhanced by various techniques, including injection into the peritoneal cavity of a 
vertebrate host. Alternatively, one may isolate DNA sequences which encode a 

15 monoclonal antibody or a binding fragment thereof by screening a DNA library from 
human B cells according to the general protocol outlined by Huse et al., Science, 
246:1275, 1989. 

Monoclonal antibodies or polyclonal sera are collected and titered against 
antigen in an immunoassay, for example, a solid phase immunoassay with the antigen 
20 _ immobilized on a solid support. Typically, polyclonal antisera with a titer of 10 4 or 
greater are selected and tested for their cross reactivity against non-sensory receptor 
proteins, or even other sensory receptor family members or other related proteins from 
other organisms, using a competitive binding immuno-assay. Specific polyclonal 
antisera and monoclonal antibodies will usually bind with a Kd of at least about 0.1 
25 mM, more usually at least about 1 pM, optionally at least about 0.1 p.M or better, and 
optionally 0.01 pM or better. 

Once sensory receptor family member specific antibodies are available, 
individual s_en_sory receptor proteins can be detected by a variety of immunoassay 
methods. For a review of immu-nological and immunoassay procedures, see Basic 
30 and Clinical Immunology (Stites & Terr eds., 7th ed., 1991). Moreover, the 
immunoassays of the present invention can be performed in any of several 
configurations, which are reviewed extensively in Enzyme Immunoassay (Maggio, ed., 
1980); and Harlow & Lane, supra. 
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2. Immunolog y binding assays 

Sensory receptor proteins can be detected and/or quantified using any of a 
number of well recognized immunological binding assays (see, e.g., U.S. Patent Nos. 
4,366,241; 4,376,110; 4,517,288; and 4,837,168). For a review of the general 
5 immunoassays, see also Methods in Cell Biology: Antibodies in Cell Biology, volume 
37 (Asai, ed. 1993); Basic and Clinical Immunology (Stites & Terr, eds., 7th ed., 
1991). Immunological binding assays (or immunoassays) typically use an antibody 
that specifically binds to a protein or antigen of choice (in this case a sensory receptor 
family member or an antigenic subsequence thereof). The antibody {e.g., anti-sensory 
10 receptor) may be produced by any of a number of means well known to those of skill 
in the art and as described above. 

Immunoassays also often use a labeling agent to specifically bind to and label 
the complex formed by the antibody and antigen. The labeling agent may itself be one 
of the moieties comprising the antibody/antigen complex. Thus, the labeling agent 
1 5 may be a labeled sensory receptor polypeptide or a labeled anti-sensory receptor 

antibody. Alternatively, the labeling agent may be a third moiety, such a secondary 
antibody, that specifically binds to the antibody/sensory receptor complex (a 
secondary antibody is typically specific to antibodies of the species from which the 
first antibody is derived). Other proteins capable of specifically binding 
20 immunoglobulin constant regions, such as protein A or protein G may also be used as 
the label agent. These proteins exhibit a strong non-immunogenic reactivity with 
immunoglobulin constant regions from a variety of species (see, e.g., Kronval et al, J. 
Immunol, 111:1401, 1973; Akerstrom et al, J. Immunol, 135:2589, 1985). The 
labeling agent can be modified with a detectable moiety, such as biotin, to which 
25 another molecule can specifically bind, such as streptavidin. A variety of detectable 
moieties are well known to those skilled in the art. 

Throughout the assays, incubation and/or washing steps may be required after 
each combination of reagents. Incubation steps can vary from about 5 seconds to 
several hours, optionally from about 5 minutes to about 24 hours. However, the 
30 incubation time will depend upon the assay format, antigen, volume of solution, 
concentrations, and the like. Usually, the assays will be carried out at ambient 
temperature, although they can be conducted over a range of temperatures, such as 
10°Cto40°C. 
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a. Non-competitive assay formats 

Immunoassays for detecting a sensory receptor protein in a sample may be 
either competi-tive or noncompetitive. Noncompetitive immunoassays directly 
measure the amount of antigen. In one preferred "sandwich" assay, for example, the 
5 anti-sensory receptor antibodies are bound directly to a solid substrate on which they 
are immobilized. These immobilized antibodies then capture the sensory receptor 
protein present in the test sample. The sensory receptor protein thus immobilized is 
then bound by a labeling agent, such as a second anti-sensory receptor antibody 
bearing a label. Alternatively, the second antibody may lack a label, but it may, in 
10 turn, be bound by a labeled third antibody specific to antibodies of the species from 
which the second antibody is derived. The second or third antibody is typically 

i 

modified with a detectable moiety, such as biotin, to which another molecule 
specifically binds, e.g., streptavidin, to provide a detectable moiety. 

b. Competitive assay formats 

15 In competitive assays, the amount of sensory receptor protein present in the 

sample is measured indirectly by measuring the amount of a known, added 
(exogenous) sensory receptor protein displaced (competed away) from an anti-sensory 
receptor antibody by the unknown sensory receptor protein present in a sample. In 
one competitive assay, a known amount of sensory receptor protein is added to a 

20 sample and the sample is then contacted with an antibody that specifically binds to the 
sensory receptor. The amount of exogenous sensory receptor protein bound to the 
antibody is inversely proportional to the concentration of sensory receptor protein 
present in the sample. In a particularly preferred embodiment, the antibody is 
immobilized on a solid substrate. The amount of- sensory receptor protein bound to 

25 the antibody may be determined either by measuring the amount of sensory receptor 
protein present in a sensory receptor/antibody complex, or alternatively by measuring 
the amount of remaining uncomplexed protein. The amount of sensory receptor 
protein may be detected by providing a labeled sensory receptor molecule. 

A hapten inhibition assay is another preferred competitive assay. In this assay 
30 the known sensory receptor protein is immobilized on a solid substrate. A known 
amount of anti-sensory receptor antibody is added to the sample, and the sample is 
then contacted with the immobilized sensory receptor. The amount of anti-sensory 
receptor antibody bound to the known immobilized sensory receptor protein is 
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inversely proportional to the amount of sensory receptor protein present in the sample. 
Again, the amount of immobilized antibody maybe detected by detecting either the 
immobilized fraction of antibody or the fraction of the antibody that remains in 
solution. Detection may be direct where the antibody is labeled or indirect by the 
5 . subsequent addition of a labeled moiety that specifically binds to the antibody as 

described above. 

c. Cross-reactivity determinations 
Immunoassays in the competitive binding format can also be used for cross- 
reactivity determinations. For example, a protein at least partially encoded by the 
10 nucleic acid sequences disclosed herein can be immobilized to a solid support. 

Proteins (e.g., sensory receptor proteins and homologs) are added to the assay that 
compete for binding of the antisera to the immobilized antigen. The ability of the 
added proteins to compete for binding of the antisera to the immobi-lized protein is 
compared to the ability of the sensory receptor polypeptide encoded by the nucleic 
1 5 acid sequences disclosed herein to compete with itself. The percentage cross- 
reactivity for the above proteins is calculated, using standard calculations. Those 
antisera with less than 1 0% cross-reactivity with each of the added proteins listed 
above are selected and pooled. The cross-reacting antibodies are optionally removed 
from the pooled antisera by immunoabsorption with the added considered proteins, 
20 e.g., distantly related homologs. In addition, peptides comprising amino acid 

sequences representing conserved motifs that are used to identify members of the 
sensory receptor family can be used in cross-reactivity determinations. 

The immunoabsorbed and pooled antisera are then used in a competitive 
binding immu-noassay as described above to compare a second protein, thought to be 
25 perhaps an allele or poly-morphic variant of a sensory receptor family member, to the 
immunogen protein (i.e. , sensory receptor protein encoded by the nucleic acid 
sequences disclosed herein). In order to make this comparison, the two proteins are 
each assayed at a wide range of concentrations and the amount of each protein 
req uired to inhibit 50% of the binding of the antisera to the immobilized protein is 
30 determined. If the amount of the second protein required to inhibit 50% of binding is 
less than 10 times the amount of the protein encoded by nucleic acid sequences 
disclosed herein required to inhibit 50% of binding, then the second protein is said to 
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specifically bind to the polyclonal antibodies generated to a sensory receptor 
immunogen. 

Antibodies raised against sensory receptor conserved motifs can also be used 
to prepare antibodies that specifically bind only to GPCRs of the sensory receptor 
5 family, but not to GPCRs from other families. Polyclonal antibodies that specifically 
bind to a particular member of the sensory receptor family, can be make by 
subtracting out cross-reactive antibodies using other sensory receptor family members. 
Species-specific polyclonal antibodies can be made in a similar way. For example, 
antibodies specific to human T2R01 can be made by, subtracting out antibodies that 
1 0 are cross-reactive with orthologous sequences, e.g., rat OLFR1 or mouse OLFR1 . 

d. Other assay formats 

Western blot (immunoblot) analysis is used to detect and quantify the presence 
of sensory receptor protein in the sample. The technique generally comprises 
separating sample proteins by gel electrophoresis on the basis of molecular weight, 

1 5 transferring the separated proteins to a suitable solid support, (such as a nitrocellulose 
filter, a nylon filter, or derivatized nylon filter), and incubating the sample with the 
antibodies that specifically bind the sensory receptor protein. The anti-sensory 
receptor polypeptide antibodies specifically bind to the sensory receptor poly-peptide 
on the solid support. These antibodies may be directly labeled or alternatively may be 

20 subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse 
antibodies) that specifically bind to the anti-sensory receptor antibodies. 

Other assay formats include liposome immunoassays (LIA) using liposomes 
designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 

25 techniques (see Monroe et ah, Amer. Clin. Prod. Rev., 5:34, 1986). 

e. Reduction of non-specific binding 

One of skill in the art will appreciate that it is often desirable to minimize 
non-specific binding in immunoassays. Particularly, where the assay involves an 
antigen or antibody immo-bilized on a solid substrate it is desirable to minimize the 
30 amount of non-specific binding to the substrate. Means of reducing such non-specific 
binding are well known to those of skill in the art. Typically, this technique involves 
coating the substrate with a proteinaceous composition. In particular, protein 
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compositions such as bovine serum albumin (BSA), nonfat powdered milk, and 
gelatin are widely used with powdered milk being most preferred, 
f. Labels 

The particular label or detectable group used in the assay is not a critical aspect 

5 of the invention, as long as it does not significantly interfere with the specific binding 
of the antibody used in the assay. The detectable group can be any material having a 
detectable physical or chemical property. Such detectable labels have been well- 
developed in the field of immuno-assays and, in general, most any label useful in such 
methods can be applied to the present invention. Thus, a label is any composition 

10 detectable by spectroscopic, photochemical, bio-chemical, immunochemical, 

electrical, optical or chemical means. Useful labels in the present invention include 
magnetic beads (e.g., DYN ABE ADS™ , fluorescent dyes (e.g., fluorescein 
isothiocyanate, Texas red, rhodamine, and the like), radiolabels (e.g., 3 H, 125 I, 35 S, 14 C, 
or 32 P), enzymes (e.g., horseradish peroxidase, alkaline phosphatase and others 

15 commonly used in an ELISA), and colorimetric labels such as colloidal gold or 
colored glass or plastic beads (e.g., polystyrene, polypropylene, latex, etc.). 

The label may be coupled directly or indirectly to the desired component of the 
assay according to methods well known in the art. As indicated above, a wide variety 
of labels may be used, with the choice of label depending on sensitivity required, ease 

20 of conjugation with the compound, stability requirements, available instrumentation, 

and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then 
binds to another molecules (e.g., streptavidin) molecule, which is either inherently 
25 detectable or covalently bound to a signal system, such as a detectable enzyme, a 
fluorescent compound, or a chemiluminescent compound. The ligands and their 
targets can be used in any suitable combination with antibodies that recognize a 
sensory receptor protein, or secondary antibodies that recognize anti-sensory receptor. 
The molecules can also be conjugated directly to signal generating 
30 compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of interest 
as labels will primarily be hydrolases, particularly phosphatases, esterases and 
glycosidases, or oxidotases, particularly peroxidases. Fluorescent compounds include 
fluorescein and its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, 
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etc. Chemiluminescent compounds include luciferin, and 

2,3-dihydrophthalazinediones, e.g., luminol. For a review of various labeling or 
signal producing systems that may be used, see U.S. Patent No. 4,391,904. 

Methods for detecting labels are well known. Thus, for example, where the 
5 label is a radioactive label, it may be detected using a scintillation counter or with 

photographic film as in autoradiography. Where the label is a fluorescent label, it may 
be detected by exciting the fluoro-chrome with the appropriate wavelength of light 
and detecting the resulting fluorescence. The fluorescence may be detected visually, 
with photographic film, or using electronic detectors such as charge coupled devices 

1 0 (CCDs) or photomultipliers. Similarly, enzymatic labels may be detected by 
providing the appropriate substrates for the enzyme and detecting the resulting 
reaction product. Finally, simple colorimetric labels may be detected simply by 
observing the color associated with the label. Thus, in various dipstick assays, 
conjugated gold often appears pink, while various conjugated beads appear the color 

15 of the bead. 

Some assay formats do not require the use of labeled components. For 
instance, aggluti-nation assays can be used to detect the presence of the target 
antibodies. In this case, antigen-coated particles are agglutinated by samples 
comprising the target antibodies. In this format, none of the components need be 
20 labeled and the presence of the target antibody is detected by simple visual inspection. 
E. Detection of Olfactory Modulators 

Methods and compositions for determining whether a test compound 
specifically binds to a mammalian sensory, and more particularly, olfactory receptor 
of the invention, both in vitro and in vivo are described below, as are methods and 
25 compositions for determining whether a test compound is neurotoxic to an olfactory 
neuron expressing an olfactory transmembrane receptor polypeptide. Any aspect of 
cell physiology can be monitored to assess the effect of sensant-binding to a naturally- 
occurring or chimeric olfactory receptor. These assays may be performed on intact 
cells expressing an olfactory receptor, on permeabilized cells or on membrane 
30 fractions produced by standard methods. 

Olfactory receptors are normally located on the specialized cilia of olfactory 
neurons. These receptors bind odorants and initiate the transduction of chemical 
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stimuli into electrical signals. An activated or inhibited G-protein will in turn alter the 
properties of target enzymes, channels, and other effector proteins. The classic 
examples are the activation of cGMP phosphodiesterase by transducin in the visual 
system, adenylate cyclase by the stimulatory G-protein, phospholipase C by Gq and 
5 other cognate G-proteins, and modulation of diverse channels by Gi and other G- 
proteins. Downstream consequences can also be examined such as generation of 
diacyl glycerol and IPS by phospholipase C, and in turn, for calcium mobilization by 
IP3. 

The sensory receptor protein of the assay will typically be selected from a 
10 natural polypeptide or conservatively modified variant thereof. Generally, the amino 
acid sequence identity will be at least 75%, 85%, 90% 95%, or 99%. Optionally, the 
polypeptide of the assays can comprise a domain of a sensory receptor protein, such as 
an extracellular domain, transmembrane region, transmembrane domain, cytoplasmic 
domain, ligand binding domain, subunit association domain, active site, and the like. 
15 Either the sensory receptor protein or a domain thereof can be covalently linked to a 
heterologous protein to create a chimeric protein used in the assays described herein. 

Modulators of sensory receptor activity can be tested using sensory receptor 
polypeptides as described above, either recombinant or naturally occurring. Protein 
can be isolated, expressed in a cell, expressed in a membrane derived from a cell, 
20 expressed in tissue or in an animal, either recombinant or naturally occurring. 

Modulation can be tested using one of the in vitro or in vivo assays described herein. 
1. In vitro binding assays 

Sensory perception can also be examined in vitro with soluble or solid state 
reactions, using a full-length sensory receptor-GPCR or a chimeric molecule such as 

25 an extracellular domain or transmembrane region, or combination thereof, of a 

sensory receptor covalently linked to a heterologous signal transduction domain, or a 
heterologous extracellular domain and/or transmembrane region covalently linked to 
the transmembrane and/or cytoplasmic domain of a sensory receptor. Furthermore, 
ligand-binding domains of the protein of interest can be used in vitro in soluble or 

30 solid state reactions to assay for ligand binding. In numerous embodiments, a 
chimeric receptor will be made that comprises all or part of a sensory receptor 
polypeptide, as well an additional sequence that facilitates the localization of the 
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sensory receptor to the membrane, such as a rhodopsin, e.g., an N-terminal fragment 
of a rhodopsin protein. 

Ligand binding to a sensory receptor protein, a domain, or chimeric protein 
can be tested in solution, in a bilayer membrane, attached to a solid phase, in a lipid 
5 monolayer, or in vesicles. Binding of a modulator can be tested using, e.g., changes in 
spectroscopic characteristics (e.g., fluorescence, absorbence, refractive index) 
hydrodynamic (e.g., shape), chromatographic, or solubility properties. Sensory 
receptors with large (e.g., approximately 600 amino acid residues) extracellular N- 
terminal segments. These N-terminal segments are thought to form ligand-binding 
10 domains, and are therefore useful in biochemical assays to identify sensory receptor 
agonists and antagonists. Similar assays have been used with other GPCRs, such as 
the metabo-tropic glutamate receptors (e.g., Han &Hampson, J. Biol. Chem. 
274:10008, 1999). These assays might involve displacing a radioactively or 
fluorescently labeled ligand, and measuring changes in intrinsic fluorescence or 
1 5 changes in proteolytic susceptibility, etc. 

Receptor-G-protein interactions can also be examined. For example, binding 
of the G-protein to the receptor or its release from the receptor can be examined. For 
example, in the absence of GTP, an activator will lead to the formation of a tight 
complex of a G protein (all three subunits) with the receptor. This complex can be 
20 detected in a variety of ways, as noted above. Such an assay can be modified to 

search for inhibitors, e.g., by adding an activator to the receptor and G protein in the 
absence of GTP, which form a tight complex, and then screen for inhibitors by 
looking at dissociation of the receptor-G protein complex. In the presence of GTP, 
release of the alpha subunit of the G protein from the other two G protein subunits 
25 serves as a criterion of activation. 

An activated or inhibited G-protein-wilLin turn alter the properties of target 
enzymes, channels, and other effector proteins. The classic examples are the 
activation of cGMP phosphodiesterase by transducin in the visual system, adenylate 
cyclase by the stimulatory G-protein, phospholipase C by Gq and other cognate G 
30 proteins, and modulation of diverse channels by Gi and other G proteins. 

Downstream consequences can also be examined such as generation of diacyl glycerol 
and IP3 by phospholipase C, and in turn, for calcium mobilization by IP3. 
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2. Fluorescence Polarizat ion Assays 

In another embodiment, Fluorescence Polarization ("FP") based assays maybe 
used to detect and monitor sensant binding. Fluorescence polarization is a versatile 
laboratory technique for measuring equilibrium binding, nucleic acid hybridization, 
5 and enzymatic activity. Fluores-cence polarization assays are homogeneous in that 
they do not require a separation step such as centrifugation, filtration, 
chromatography, precipitation, or electrophoresis. These assays are done in real time, 
directly in solution and do not require an immobilized phase. Polarization values can 
be measured repeatedly and after the addition of reagents since measuring the polari- 
10 zation is rapid and does not destroy the sample. Generally, this technique can be used 
to measure polarization values of fluorophores from low picomolar to micromolar 
levels. This section describes how fluorescence polarization can be used in a simple 
and quantitative way to measure the binding of odorants to the sensory receptors of 
the invention. 

15 When a fluorescently labeled molecule is excited with plane polarized light, it 

emits light that has a degree of polarization that is inversely proportional to its 
molecular rotation. Large fluorescently labeled molecules remain relatively stationary 
during the excited state ( 4 nano-seconds in the case of fluorescein) and the 
polarization of the light remains relatively constant between excitation and emission. 
20 Small fluorescently labeled molecules rotate rapidly during the excited state and the 
polarization changes significantly between excitation and emission. There-fore, small 
* molecules have low polarization values and large molecules have high polarization 
. values. For example, a single-stranded fluorescein-labeled oligonucleotide has a 
relatively low polarization value but when it is hybridized to a complementary strand, 
25 it has a higher polariza-tion value. When using FP to detect and monitor odorant- 
binding which may activate or inhibit the sensory receptors of the invention, 
fluorescence-labeled sensants or auto-fluorescent sensants may be used. 
Fluorescence polarization (P) is defined as: 

p Int u -Int ± 
Int u +Int ± 

30 Where U is the intensity of the emission light parallel to the excitation light 

plane and hit J. is the intensity of the emission light perpendicular to the excitation 
light plane. P, being a ratio of light intensities, is a dimensionless number. For 
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example, the Beacon® and Beacon 2000™ System may be used in connection with 
these assays. Such systems typically express polarization in millipolarization units (1 
Polarization Unit =1000 mP Units). 

The relationship between molecular rotation and size is described by the Perrin 
5 equation and the reader is referred to Jolley J. Anal. Toxicol 5, 236, 1981 which gives 
a thorough expla-nation of this equation. Summarily, the Perrin equation states that 
polarization is directly proportional to the rotational relaxation time, the time that it 
takes a molecule to rotate through an angle of approximately 68.5°. Rotational 
relaxation time is related to viscosity (t|), absolute tempera-ture (T), molecular 
1 0 volume (V), and the gas constant (R) by the following equation: 

3t?V 

i Rotational Re laxation Time = — — 

RT 

The rotational relaxation time is small (« 1 nanosecond) for small molecules 
(e.g., fluorescein) and large (« 100 nanoseconds) for large molecules (e.g., 
immunoglobulins). If viscosity and temperature are held constant, rotational 
15 relaxation time, and therefore polarization, is directly related to the molecular volume. 
Changes in molecular volume may be due to inter-actions with other molecules, 
dissociation, polymerization, degradation, hybridization, or confor-mational changes 
of the fluorescently labeled molecule. For example, fluorescence polarization has 
been used to measure enzymatic cleavage of large fluorescein labeled polymers by 
20 proteases, DNases, and RNases. It also has been used to measure equilibrium binding 
for protein/protein interactions, antibody/antigen binding, and protein/DNA binding. 
3. Soluble and solid state high throughput assays 
In yet another embodiment, the invention provides soluble assays using 
molecules such as a domain such as ligand binding domain, an extracellular domain, a 
25 transmembrane domain (e.g., one comprising seven transmembrane regions and 

cytosolic loops), the transmembrane domain and a cytoplasmic domain, an active site, 
_ a subunit association .region, etc.; a domain that is coyalently linked to a heterologous 
protein to create a chimeric molecule; a sensory receptor protein; or a cell or tissue 
expressing a sensory receptor protein, either naturally occurring or recombinant. In 
30 another embodiment, the invention provides solid phase based in vitro assays in a high 
throughput format, where the domain, chimeric molecule, sensory receptor protein, or 
cell or tissue expressing the sensory receptor is attached to a solid phase substrate. 
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In the high throughput assays of the invention, it is possible to screen up to 
several thousand different modulators or ligands in a single day. In particular, each 
well of a microtiter plate can be used to run a separate assay against a selected 
potential modulator, or, if concentra-tion or incubation time effects are to be observed, 

5 every 5-10 wells can test a single modulator. Thus, a single standard microtiter plate 
can assay about 1 00 (e.g. , 96) modulators. If 1 536 well plates are used, then a single 
plate can easily assay from about 1000 to about 1500 different compounds. It is 
possible to assay several different plates per day; assay screens for up to about 
6,000-20,000 different compounds is possible using the integrated systems of the 

1 0 invention. More recently, microfluidic approaches to reagent manipulation have been 
developed. 

The molecule of interest can be bound to the solid state component, directly or 
indirectly, via covalent or non covalent linkage, e.g., via a tag. The tag can be any of a 
variety of compo-nents. In general, a molecule which binds the tag (a tag binder) is 
1 5 fixed to a solid support, and the tagged molecule of interest (e.g. , the taste 

transduction molecule of interest) is attached to the solid support by interaction of the 

tag and the tag binder. 

A number of tags and tag binders can be used, based upon known molecular 
interactions well described in the literature. For example, where a tag has a natural 

20 binder, for example, biotin, protein A, or protein G, it can be used in conjunction with 
appropriate tag binders (avidin, streptavidin, neutravidin, the Fc region of an 
immunoglobulin, etc.). Antibodies to molecules with natural binders such as biotin 
are also widely available and appropriate tag binders (see, SIGMA Immunochemicals 
1998 catalogue SIGMA, St. Louis MO). 

25 Similarly, any haptenic or antigenic compound can be used in combination 

with an appro-priate antibody to form a tag/tag binder pair. Thousands of specific 
antibodies are commercially available and many additional antibodies are described in 
the literature. For example, in one common configuration, the tag is a first antibody 
and the tag binder is a second antibody which recognizes the first antibody. In 

30 addition to antibody-antigen interactions, receptor-ligand inter-actions are also 

appropriate as tag and tag-binder pairs. For example, agonists and antagonists of cell 
membrane receptors (e.g., cell receptor-ligand interactions such as transferrin, c-kit, 
viral receptor ligands, cytokine receptors, chemokine receptors, interleukin receptors, 
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describing arrays of biopolymers fixed to solid substrates). Non-chemical approaches 
for fixing tag binders to substrates include other common methods, such as heat, 
cross-linking by UV radiation, and the like. 
4. Computer-based assays 
5 Yet another assay for compounds that modulate sensory receptor protein 

activity involves computer assisted drug design, in which a digital or analog 
processing system is used to generate a three-dimensional structure of a sensory 
receptor protein based on the structural information encoded by its amino acid 
sequence. The input amino acid sequence interacts directly and actively with a 
1 0 preestablished algorithm in a computer program to yield secondary, tertiary, and 

quaternary structural models of the protein. The models of the protein structure are 
then examined to identify regions of the structure that have the ability to bind, e.g., 
ligands. These regions are then used to identify ligands that bind to the protein. 

The three-dimensional structural model of the protein is generated by entering 
15 protein amino acid sequences of at least 10 amino acid residues or corresponding 
nucleic acid sequences encoding a sensory receptor polypeptide into the computer 
system. The nucleotide sequence encoding the polypeptide, or the amino acid 
sequence thereof, can be any of those described herein, or fragments or variants 
thereof. 

The amino acid sequence represents the primary sequence or subsequence of 
the protein, which encodes the structural information of the protein. At least 10 
residues of the amino acid sequence (or a nucleotide sequence encoding 10 amino 
acids) are entered into the computer system from computer keyboards, computer 
readable substrates that include, but are not limited to, electronic storage media (e.g., 
magnetic diskettes, tapes, cartridges, and chips), optical media (e.g., CD ROM), 
information distributed by internet sites, and by RAM. The three-dimensional 
structural model of the protein is then generated by the interaction of the amino acid 
sequence and the computer system, using software known to those of skill in the art. 

The amino acid sequence represents a primary structure that encodes the 
information necessary to form the secondary, tertiary and quaternary structure of the 
protein of interest The software looks at certain parameters encoded by the primary 
sequence to generate the structural model. These parameters are referred to as "energy 
terms," and primarily include electrostatic potentials, hydrophobic potentials, solvent 
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accessible surfaces, and hydrogen bonding. Secon-dary energy terms include van der 
Waals potentials. Biological molecules form the structures that minimize the energy 
terms in a cumulative fashion. The computer program is therefore using these terms 
encoded by the primary structure or amino acid sequence to create the secon-dary 

5 structural model. 

The tertiary structure of the protein encoded by the secondary structure is then 
formed on the basis of the energy terms of the secondary structure. The user at this 
point can enter addi-tional variables such as whether the protein is membrane bound 
or soluble, its location in the body, and its cellular location, e.g., cytoplasmic, surface, 
10 or nuclear. These variables along with the energy terms of the secondary structure are 
used to form the model of the tertiary structure. In modeling the tertiary structure, the 
computer program matches hydrophobic faces of secondary structure with like, and 
hydrophilic faces of secondary structure with like. 

Once the structure has been generated, potential ligand binding regions are 
1 5 identified by the computer system. Three-dimensional structures for potential ligands 
are generated by entering amino acid or nucleotide sequences or chemical formulas of 
compounds, as described above. The three-dimensional structure of the potential 
ligand is then compared to that of the sensory receptor protein to identify ligands that 
bind to the protein. Binding affinity between the protein and ligands is determined 
20 using energy terms to determine which ligands have an enhanced probability of 
binding to the protein. 

Computer systems are also used to screen for mutations, polymorphic variants, 
alleles and interspecies homologs of sensory receptor genes. Such mutations can be 
associated with disease states or genetic traits. As described above, GeneChip™ and 
25 related technology can also be used to screen for mutations, polymorphic variants, 

alleles and interspecies homologs. Once the variants are identified, diagnostic assays 
can be used to identify patients having such mutated genes. Identification of the 
mutated sensory receptor genes involves receiving input of a first nucleic acid or 
amino acid sequence of a sensory receptor gene, or conservatively modified versions 
30 thereof. The sequence is entered into the computer system as described above. The 
first nucleic acid or amino acid sequence is then compared to a second nucleic acid or 
amino acid sequence that has substantial identity to the first sequence. The second 
sequence is entered into the computer system in the manner described above. Once 
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odorant dependent activation monitored. Control samples (untreated with activators 
or inhibitors) are assigned a relative sensory receptor activity value of 100. Inhibition 
of a sensory receptor protein is achieved when the sensory receptor activity value 
relative to the control is about 90%, optionally 50%, optionally 25-0%. Activation of 
5 a sensory receptor protein is achieved when the sensory receptor activity value relative 
to the control is 1 10%, optionally 150%, 200-500%, or 1000-2000%. 

Changes in ion flux may be assessed by determining changes in polarization 
(i.e., electri-cal potential) of the cell or membrane expressing a sensory receptor 
protein. One means to deter-mine changes in cellular polarization is by measuring 

10 changes in current, and thereby measuring changes in polarization, with voltage-clamp 
and patch-clamp techniques, e.g., the "cell-attached" mode, the "inside-out" mode, 
and the "whole cell" mode {see, e.g., Ackerman et al, New Engl. J Med., 336:1575, 
1997). Whole cell currents are conveniently determined using the standard. Other 
known assays include: assays to measure ion flux using radiolabeled or fluorescent 

15 probes such as voltage-sensitive dyes {see, e.g., Vestergarrd-Bogind et al, J. 

Membrane Biol, 88:67, 1988; Gonzales & Tsien, Chem. Biol, 4:269, 1997; Daniel et 
al, J. Pharmacol Meth., 25:185, 1991; Holevinsky et al, J. Membrane Biology, 
137:59, 1994). Generally, the compounds to be tested are present in the range from 1 
pMto 100 mM. 

20 The effects of the test compounds upon the function of the polypeptides can be 

measured by examining any of the parameters described above. Any suitable 
physiological change that affects GPCR activity can be used to assess the influence of 
a test compound on the polypeptides of this invention. When the functional 
consequences are determined using intact cells or animals, one can also measure a 

25 variety of effects such as transmitter release, hormone release, transcriptional changes 
to both known and uncharacterized genetic markers (e.g., Northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular 
second messengers such as Ca 2+ , IP3, cGMP, or cAMP. 

Prefeired assays for G-protein coupled receptors include cells that are loaded 

30 with ion or voltage sensitive dyes to report receptor activity. Assays for determining 
activity of such receptors can also use known agonists and antagonists for other 
G-protein coupled receptors as negative or positive controls to assess activity of tested 
compounds. In assays for identifying modulatory compounds {e.g., agonists, 
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antagonists), changes in the level of ions in the cytoplasm or membrane voltage will 
be monitored using an ion sensitive or membrane voltage fluorescent indicator, 
respectively. Among the ion-sensitive indicators and voltage probes that may be 
employed are those disclosed in the Molecular Probes 1 997 Catalog. For G-protein 
5 coupled receptors, promiscuous G-proteins such as Gal 5 and Gal 6 can be used in the 
assay of choice (Wilkie et al, Proc. Natl. Acad. Sci., 88:10049, 1991). Such 
promiscuous G-proteins allow coupling of a wide range of receptors. 

Receptor activation typically initiates subsequent intracellular events, e.g., 
increases in second messengers such as IP3, which releases intracellular stores of 
1 o calcium ions. Activation of some G-protein coupled receptors stimulates the 

formation of inositol triphosphate (IPS) through phospholipase C-mediated hydrolysis 
of phosphatidylinositol (Berridge & Irvine, Nature 312:315, 1984). TP3 in rum 
stimulates the release of intracellular calcium ion stores. Thus, a change in 
cytoplasmic calcium ion levels, or a change in second messenger levels such as IPS 
15 can be used to assess G-protein coupled receptor function. Cells expressing such 
G-protein coupled receptors may exhibit increased cytoplasmic calcium levels as a 
result of contribution from both intracellular stores and via activation of ion channels, 
in which case it may be desirable although not necessary to conduct such assays in 
calcium-free buffer, optionally supplemented with a chelating agent such as EGTA, to 
20 distinguish fluorescence response resulting from calcium release from internal stores. 

Other assays can involve determining the activity of receptors which, when 
activated, result in a change in the level of intracellular cyclic nucleotides, e.g., cAMP 
or cGMP, by activating or inhibiting enzymes such as adenylate cyclase. There are 
cyclic nucleotide-gated ion channels, e.g., rod photoreceptor cell channels and 
25 olfactory neuron channels that are permeable to cations upon activation by binding of 
cAMP or cGMP (see, e.g., Altenhofen et al, Proc. Natl. Acad. Set., 88:9868, 1991 
and Dhallan et al., Nature 347:184, 1990). In cases where activation of the receptor 
results in a decrease in cyclic nucleotide levels, it may be preferable to expose the 

cells to agents that increase intracellular cyclic nucleotide levels, e.g., forskolin, prior 

30 to adding a receptor-activating compound to the cells in the assay. Cells for this type 
of assay can be made by co-transfection of a host cell with DNA encoding a cyclic 
nucleotide-crated ion channel, GPCR phosphatase and DNA encoding a receptor (e.g., 
certain glutamate receptors, muscarinic acetylcholine receptors, dopamine receptors, 
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serotonin receptors, and the like), which, when activated, causes a change in cyclic 
nucleotide levels in the cytoplasm. 

In a preferred embodiment, sensory receptor protein activity is measured by 
expressing a sensory receptor gene in a heterologous cell with a promiscuous 
5 G-protein that links the receptor to a phospholipase C signal transduction pathway 
(see Offermanns & Simon, J. Biol Chem., 270:15175, 1995). Optionally the cell line 
is HEK-293 (which does not naturally express sensory receptor genes) and the 
promiscuous G-protein is Gal 5 (Offermanns & Simon, supra). Modulation of 
olfactory transduction is assayed by measuring changes in intracellular Ca 2+ levels, 
10 which change in response to modulation of the sensory receptor signal transduction 
pathway via administration of a molecule that associates with a sensory receptor 
protein. Changes in Ca 2+ levels are optionally measured using fluorescent Ca 2+ 
indicator dyes and fluorometric imaging. 

In one embodiment, the changes in intracellular cAMP or cGMP can be 
15 measured using immunoassays. The method described in Offermanns & Simon, J. 
Biol Chem., 270:15175, 1995, maybe used to determine the level of cAMP. Also, 
the method described in Felley-Bosco et al 9 Am. J. Resp. Cell andMol. Biol, 1 1:159, 
1994, may be used to determine the level of cGMP. Further, an assay kit for 
measuring cAMP and/or cGMP is described in U.S. Patent No. 4,1 15,538, herein 
20 incorporated by reference. 

In another embodiment, phosphatidyl inositol (PI) hydrolysis can be analyzed 
according to U.S. Patent No. 5,436,128, herein incorporated by reference. Briefly, the 
assay involves labeling of cells with 3H-myoinositol for 48 or more hrs. The labeled 
cells are treated with a test compound for one hour. The treated cells are lysed and 
25 extracted in chloroform-methanol-water after which the inositol phosphates were 
separated by ion exchange chromatography and quantified by scintillation counting. 
Fold stimulation is determined by calculating the ratio of cpm in the presence of 
agonist, to cpm in the presence of buffer control. Likewise, fold inhibition is 
determined by calculating the ratio of cpm in the presence of antagonist, to cpm in the 
30 presence of buffer control (which may or may not contain an agonist). 

In another embodiment, transcription levels can be measured to assess the 
effects of a test compound on signal transduction. A host cell containing a sensory 
receptor protein of interest is contacted with a test compound for a sufficient time to 
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effect any interactions, and then the level of gene expression is measured. The 
amount of time to effect such interactions maybe empiri-cally determined, such as by 
running a time course and measuring the level of transcription as a function of time. 
The amount of transcription may be measured by using any method known to those of 
5 skill in the art to be suitable. For example, mRNA expression of the protein of 
interest may be detected using northern blots or their polypeptide products may be 
identified using immunoassays. Alternatively, transcription based assays using 
reporter gene maybe used as described in U.S. Patent No. 5,436,128, herein 
incorporated by reference. The reporter genes can be, e.g., chloramphenicol 
1 0 acetyltransferase, luciferase, '3-galactosidase and alkaline phosphatase. Furthermore, 
the protein of interest can be used as an indirect reporter via attachment to a second 
reporter such as green fluorescent protein (see, e.g., Mistili & Spector, Nature 

Biotech. 15:961, 1997). 

The amount of transcription is then compared to the amount of transcription in 
15 either the same cell in the absence of the test compound, or it may be compared with 
the amount of transcription in a substantially identical cell that lacks the sensory 
receptor protein of interest. A substantially identical cell may be derived from the 
same cells from which the recombinant cell was prepared but which had not been 
modified by introduction of heterologous DNA. Any difference in the amount of 
20 transcription indicates that the test compound has in some manner altered the activity 
of the sensory receptor protein of interest. 

6. Transgenic non-human a n imals expressing sensory receptors 
Non-human animals expressing one or more sensory receptor sequences of the 
invention, particularly human olfactory receptor sequences, can also be used for 
25 receptor assays. Such expression can be used to determine whether a test compound 
specifically binds to a mammalian olfactory transmembrane receptor polypeptide in 
vivo by contacting a non-human animal stably or transiently transfected with a nucleic 
acid encoding an olfactory receptor or ligand binding region thereof with a test 
compound and determining whether the animal reacts to the test compound by 
30 specifically binding to the receptor polypeptide. 

Use of translocation domains in the fusion polypeptides generates a cell 
expressing high levels of olfactory receptor. Animals transfected or infected with the 
vectors of the invention are particularly useful for assays to identify and characterize 
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odorants/ligands that can bind to a specific or sets of receptors. Such vector-infected 
animals expressing libraries of human olfactory sequences can be used for in vivo 
screening of odorants and their effect on, e.g., cell physiology {e.g., on olfactory 
neurons), on the CNS {e.g., olfactory bulb activity), or behavior. 
5 Means to infect/express the nucleic acids and vectors, either individually or as 

libraries, are well known in the art. A variety of individual cell, organ or whole 
animal parameters can be measured by a variety of means. For example, recording of 
stimulant-induced waves (bulbar responses) from the main olfactory bulb or accessory 
olfactory bulb is a useful tool for measuring quantitative stable olfactory responses. 
10 When electrodes are located on the olfactory bulb surface it is possible to record stable 
responses over a period of several days {see, e.g., Kashiwayanagi, Brain Res. Protoc. 
1 :287, 1997). In this study, electroolfactogram recordings were made with a four- 
electrode assembly from the olfactory epithelium overlying the endoturbinate bones 
facing the nasal septum. Four electrodes were fixed along the dorsal-to-ventral axis of 
15 one turbinate bone or were placed in corresponding positions on four turbinate bones 
and moved together up toward the top of the bone. See also, Scott, J. NeurophysioL 
77:1950, 1997; Scott, J. Neuro-physiol. 75:2036, 1996; Ezeh, J. NeurophysioL 
73:2207, 1995. In other systems, fluorescence changes in nasal epithelium can be 
measured using the dye di-4-ANEPPS, which is applied on the rat's nasal septum and 
20 medial surface of the turbinates {see, e.g., Youngentob, J. Neuro-physiol 73:387, 

1995). Extracellular potassium activity (aK) measurements can also be carried out in 
in vivo. An increase in aK can be measured in the mucus and the proximal part of the 
nasal epithelium {see, e.g., Khayari, Brain Res. 539:1, 1991). 

The sensory receptor sequences of the invention can be for example expressed 
25 in animal nasal epithelium by delivery with an infecting agent, e.g., adenovirus 

expression vector. Recom-binant adenovirus-mediated expression of a recombinant 
gene in olfactory epithelium using green fluorescent protein as a marker is described 
- by, e.g., Touhara, Proc.NatL Acad. ScL USA 96:4040, 1999. 

The endogenous olfactory receptor genes can remain functional and wild-type 
30 (native) activity can still be present. In other situations, where it is desirable that all 
olfactory receptor activity is by the introduced exogenous hybrid receptor, use of a 
knockout line is preferred. Methods for the construction of non-human transgenic 
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animals, particularly transgenic mice, and the selection and preparation of 
recombinant constructs for generating transformed cells are well known in the art. 

Construction of a "knockout" cell and animal is based on the premise that the 
level of expression of a particular gene in a mammalian cell can be decreased or 
5 completely abrogated by introducing into the genome a new DNA sequence that 
serves to interrupt some portion of the DNA sequence of the gene to be suppressed. 
Also, "gene trap insertion" can be used to disrupt a host gene, and mouse embryonic 
stem (ES) cells can be used to produce knockout transgenic animals (see, e.g., 
Holzschu, Transgenic Res 6:97, 1997). The insertion of the exogenous is typically by 
10 homologous recombination between complementary nucleic acid sequences. The 

exogenous sequence is some portion of the target gene to be modified, such as exonic, 
intronic or transcriptional regulatory sequences, or any genomic sequence which is 
able to affect the level of the target gene's expression; or a combination thereof. Gene 
targeting via homologous recombi-nation in pluripotential embryonic stem (ES) cells 
15 allows one to modify precisely the genomic sequence of interest. Any technique can 
be used to create, screen for, propagate, a knockout animal, e.g., see Bijvoet, Hum. 
Mol. Genet. 7:53, 1998); Moreadith, J. Mol. Med. 75:208, 1997; Tojo, Cytotechnology 
19:161, 1995; Mudgett, Methods Mol. Biol. 48:167, 1995; Longo, Transgenic Res. 
6:321, 1997; U.S. Patents Nos. 5,616,491; 5,464,764; 5,631,153; 5,487,992; 
20 5,627^059; 5,272,071; WO 91/09955; WO93/09222; WO 96/2941 1; WO 95/31560; 
WO 91/12650. 

The nucleic acid libraries can also be used as reagents to produce "knockout" 
human cells and their progeny. 
F. Modulators 

25 The compounds tested as modulators of a sensory receptor family member can 

be any small chemical compound, or a biological entity, such as a protein, sugar, 
nucleic acid or lipid. Alternatively, modulators can be genetically altered versions of 
a sensory receptor gene. Typically, test compounds will be small chemical molecules 
and peptides. Essentially any chemical compound can be used as a potential 

30 modulator or ligand in the assays of the invention, although most often 

compounds-can be dissolved in aqueous or organic (especially DMSO-based) 
solutions are used. The assays are designed to screen large chemical libraries by 
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automating the assay steps and providing compounds from any convenient source to 
assays, which are typically run in parallel (e.g., in microtiter formats on microtiter 
plates in robotic assays). It will be appreciated that there are many suppliers of 
chemical compounds, including Sigma (St. Louis, MO), Aldrich (St. Louis, MO), 
5 Sigma- Aldrich (St. Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs, 
Switzerland) and the like. 

In one preferred embodiment, high throughput screening methods involve 
providing a combinatorial chemical or peptide library containing a large number of 
potential therapeutic compounds (potential modulator or ligand compounds). Such 
10 "combinatorial chemical libraries" or "ligand libraries" are then screened in one or 
more assays, as described herein, to identify those library members (particular 
chemical species or subclasses) that display a desired charac-teristic activity. The 
compounds thus identified can serve as conventional "lead compounds" or can 
themselves be used as potential or actual therapeutics. 
1 5 A combinatorial chemical library is a collection of diverse chemical 

compounds generated by either chemical synthesis or biological .synthesis, by 
combining a number of chemical "building blocks" such as reagents. For example, a 
linear combinatorial chemical library such as a polypeptide library is formed by 
combining a set of chemical building blocks (amino acids) in every possible way for a 
20 given compound length (i.e., the number of amino acids in a polypeptide compound). 
Millions of chemical compounds can be synthesized through such combinatorial 
mixing, of chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in the art. Such combinatorial chemical libraries include, but are not 
25 limited to, peptide libraries (see, e.g., U.S. Patent No. 5,010,175; Furka, Int. J. Pept. 
Prot. Res. 37:487, 1991; and Houghton et al, Nature 354:84, 1991). Other 
chemistries for generating chemical diversity libraries can also be used. Such 
^chemistries include, but are not limited to: peptoids (e.g., WO 91/19735), encoded 
peptides (e.g., WO 93/20242), random bio-oligomers (e.g., WO 92/00091), 
30 benzodiazepines (e.g., U.S. Patent No. 5,288,5 1 4), diversomers such as hydantoins, 
benzodiaze-pines and dipeptides (Hobbs et al, Proc. Nat. Acad. Sci. 90:6909, 1993), 
vinylogous polypep-tides (Hagihara et al., J. Amer. Chem. Soc. 1 14:6568, 1992), 
nonpeptidal peptidomimetics with glucose scaffolding (Hirschmann et al, J. Amer. 
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Moreover, a method is provided for simulating a fragrance or flavor sensed by 
a mammal (e.g. , human), comprising: for each of a plurality of sensory receptors, or 
fragments or variants thereof, ascertaining the extent to which the sensory receptor 
interacts with the fragrance and/or flavor; and combining a plurality of compounds, 
5 eachhavingapreviously-determinedinteractionwithoneormoreofthesensory 

receptors, in amounts that together provide a stimulation profile that mimics the 
profile for the fragrance and/or flavor. Interaction of a fragrance and/or flavor with a 
sensory receptor can be determined using any of the binding or reporter assays 
described herein. The interactions can be aggregated or a profile generated using 
10 known signal processing techniques (e.g., a neural network) as described below. The 
sensory receptor, or fragments or variants thereof (e.g., fusion proteins with reporters, 
chimeric proteins) may be expressed in cells; otherwise, ligand-binding domain(s) 
may be fixed to a substrate (e.g. , planar, bead, or fiber) that is solid or porous. The 
plurality of compounds may then be combined to form a mixture. If desired, one or 
15 more of the plurality of the compounds can be combined covalently. The combined 
compounds substantially stimulate at least 50<>/o, 60%, 70%, 15%, 80%, 90%, 95%, 
99% or all of the receptors that are substantially stimulated by the fragrance or flavor. 

In yet another aspect of the invention, a method is provided in which a 
plurality of standard compounds are tested against a plurality of sensory receptors, or 
20 fragments or variants thereof, to ascertain the extent to which the sensory receptors 
each interact with each standard compound, thereby generating a receptor stimulation 
profile for each standard compound. These receptor stimulation profiles may then be 
stored in a relational database on data storage medium. The method may further 
comprise providing a desired receptor-stimulation profile for an odor and/or taste; 
25 comparing the desired receptor stimulation profile to the relational database; and 
ascertaining one or more combinations of standard compounds that most closely 
match the desired receptor-stimulation profile. The method may further comprise 
combining standard compounds in one or more of the ascertained combinations to 

simulate the odor and/or taste. 
30 A further aspect of the invention is to provide a method for representing 

sensory percep-tion of a particular odor and/or taste in a mammal (e.g., human), 
comprising: providing values X, to X n representative of the quantitative stimulation of 
each of n sensory receptors of the mammal; where n is greater than or equal to 5, n is 
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greater than or equal to 10, n is greater than or equal to 20, n is greater than or equal to 
50, n is greater than or equal to 75, n is greater than or equal to 100, n is greater than 
or equal to 125, n is greater than or equal to 150, n is greater than or equal to 175, n is 
greater than or equal to 200, n is greater than or equal to 225, n is greater than or equal 
5 to 250, n is greater than or equal to 275, n is greater than or equal to 300, n is greater 
than or equal to 325, or n is greater than or equal to 350; and generating from the 
values a quantitative representation of sensory perception. The sensory receptors may 
be a receptor disclosed herein, or fragments or variants thereof. The representation 
may constitute a point or a volume in ^-dimensional space, may constitute a graph or a 
10 spectrum, or may constitutes a matrix of quantita-tive representations. Also, the 
providing step may comprise contacting a plurality of recombi-nantly-produced 
sensory receptors, or fragments or variants thereof, with a composition and 
quantitatively measuring the interaction of the composition with the receptors. The 
maximum number of taste receptors that are needed to mimic the native repertoire 
15 (e.g., about 50) may be less than the maximum number of olfactory receptors that are 
needed (e.g., about 350). But the number of sensory receptors that need to be 
represented in an assay to provide useful results may be much less. 

It is yet another aspect of the invention to provide a method for predicting the 
sensory perception in a mammal (e.g., human) generated by one or more molecules or 
20 combinations of molecules yielding unknown olfactory perception in the mammal, 

comprising: providing values Xi to X n representative of the quantitative stimulation of 
each of n sensory receptors of the mammal; where n is greater than or equal to 5, n is 
greater than or equal to 1 0, n is greater than or equal to 20, n is greater than or equal to 
50, n is greater than or equal to 75, n is greater than or equal to 100, n is greater than 
25 or equal to 125, /; is greater than or equal to 150, n is greater than or equal to 175, n is 
greater than or equal to 200, n is greater than or equal to 225, n is greater than or equal 
to 250, n is greater than or equal to 275, n is greater than or equal to 300, n is greater 
than or equal to 325, or n is greater than or equal to 350; for one or more molecules or 
combinations of molecules yielding known sensory perception in a mammal; and 
30 generating from the values a quantitative representation of sensory perception in a 

mammal for the one or more molecules or combinations of molecules yielding known 
sensory perception in a mammal, providing values Xi to X n representative of the 
quantitative stimulation of each of n sensory receptors of the mammal; where n is 
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greater than or equal to 5, n is greater than or equal to 1 0, n is greater than or equal to 
20, n is greater than or equal to 50, n is greater than or equal to 75, n is greater than or 
equal to 100, n is greater than or equal to 125, n is greater than or equal to 150, n is 
greater than or equal to 175, n is greater than or equal to 200, n is greater than or equal 
5 to 225, n is greater than or equal to 250, n is greater than or equal to 275, n is greater 
than or equal to 300, n is greater than or equal to 325, or n is greater than or equal to 
350; for one or more mole-cules or combinations of molecules yielding unknown 
sensory perception in a mammal; and generating from the values a quantitative 
representation of sensory perception in a mammal for the one or more molecules or 
1 0 combinations of molecules yielding unknown sensory perception in a mammal, and 
predicting the olfactory perception in a mammal generated by one or more molecules 
or combinations of molecules yielding unknown sensory perception in a mammal by 
comparing the quantitative representation of sensory perception in a mammal for the 
one or more molecules or combinations of molecules yielding unknown sensory 
1 5 perception in a mammal to the quantitative representation of sensory perception in a 
mammal for the one or more molecules or combinations of molecules yielding known 
sensory perception in a mammal. The sensory receptors used in this method may 
include a receptor disclosed herein, or fragment or variant thereof. The maximum 
number of taste receptors that are needed to mimic'the native repertoire {e.g., about 
20 50) may be less than the maximum number of olfactory receptors that are needed (e.g., 
about 350). But the number of sensory receptors that need to be represented in an 
assay to provide useful results may be much less. 

Assaying for ligand-receptor binding in a large set of sensory receptors is 
envisioned to mimic the function of the nose or tongue in sensory perception of 
25 chemical sensants. For a set of sensory receptors with a redundancy of ligand-binding 
domains, the resultant signal is a summa-tion of the interactions of all sensory 
receptors in the set: none, one, or a few of which have ligand-binding domains that are 
a perfect fit for the sensant; some of which have ligand-binding domains that are only 
an imperfect fit for the sensant; and most-of which do not significantly bind to the 
30 sensant. The individual sensory receptor's binding affinity for sensant varies accor- 
dingly. Thus, a suitably large set of ligand-binding domains from sensory receptors 
will be analogous to any ability of the native repertoire of olfactory or taste receptors 
to tolerate varying degrees of mismatch between ligand and ligand-binding domain. 
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The complication of desensiti-zation of sensory receptor signaling will also be 
avoided by using the invention instead of preparations of nasal or tongue epithelium 
containing native sensory receptors. For example, the event of binding between 
ligand and receptor can be separated from more complicated downstream signaling 
5 events in the epithelial cells. But the latter events can be reconstituted in the invention 
by transferring the sensory receptor of interest into appropriate host cells containing 
an intact and functional signaling pathway. 

H. Administration of Novel Sensant Compositions 

Sensory modulators can be administered directly to a mammal (e.g., human) 
10 for modu-lation of sensory perception in vivo. Administration is by any of the routes 
normally used for introducing a modulator compound into ultimate contact with the 
tissue to be treated (e.g., nose or tongue). The olfactory modulators are administered 
in any suitable manner, optionally with acceptable carriers. Suitable methods of 
administering such modulators are available and well known to those of skill in the 
15 art, and, although more than one route can be used to administer a particular 
composition, a particular route can often provide a more immediate and more 
effective reaction than another route. Acceptable carriers are determined at least in 
part by the particular components of the composition to be administered (e.g., 
stabilizing the sensants), as well as by the particular method used to administer the 
20 composition. Accordingly, there is a wide variety of suitable formulations of 
pharmaceutical compositions of the present invention (see, e.g., Remington 's 
Pharmaceutical Sciences, 17th ed. 1985). 

The sensory modulators, alone or in combination with other suitable 
components, can be made into aerosol formulations (i.e., they can be "nebulized") to 
25 be administered via inhalation. Aerosol formulations can be placed into pressurized 
acceptable propellants, such as dichloro-difluoromethane, propane, nitrogen, and the 
like that may or may not contribute to sensory perception. Other possible formulation 
include dry or liquid forms, powders or tablets, solutions of polar (e.g., water) or 
nonpolar (e.g., alcohol) solvents, emulsions or suspensions, creams, gels, lotions, and 
30 syrups. 

Formulations suitable for administration include aqueous and non-aqueous 
solutions, isotonic sterile solutions, which can contain antioxidants; buffers, 
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bacteriostats, and solutes that render the formulation isotonic, and aqueous and 
non-aqueous sterile suspensions that can include suspending agents, solubilizers, 
thickening agents, stabilizers, and preservatives. In the practice of this invention, 
compositions can be administered, for example, by orally, topically, intravenously, 
5 intraperitoneal^, intravesically, or intrathecally. Optionally, the compositions are 
administered orally or nasally. The formulations of compounds can be presented in 
unit-dose or multi-dose sealed containers, such as ampules and vials. Solutions and 
suspensions can be prepared from sterile powders, granules, and tablets of the kind 
previously described. The modu-lators can also be administered as part a of prepared 
10 drug, food, or cosmetic. In particular, an un-pleasant odor or taste (e.g., sulfur or 
bitter, respectively) may not be perceived as such and/or its effects reduced by 
blocking the binding between sensant ligand and sensory receptor by adding a 
competitor ligand that blocks binding between cognate ligand and receptor, or 
inhibiting or reducing signal transduction. In contrast, a pleasant odor or taste can be 
15 mimicked or enhanced. Primary sensants are preferred because the subset of activated 
cells is kept small and the effects limited to projection into a specific region of the 
brain. But novel olfactants or combinations thereof that bind only a few olfactory 
receptors (e.g., having less than five different ligand-binding domains) would also be 
useful. 

20 The dose administered to a mammal (e.g. , human) should be sufficient to 

effect a bene-ficial response in the subject over time. The dose will be determined by 
the efficacy of the parti-cular sensory modulators employed and the condition of the 
subject, as well as the body weight or surface area of the area to be treated. The size 
of the dose also will be determined by the existence, nature, and extent of any adverse 
25 side-effects that accompany the administration of a particular compound or vector in a 
particular subject. In determining the effective amount of the modulator to be 
administered in a physician may evaluate circulating plasma levels of the sensory 
modulator, modulator toxicities,, and the production of anti-modulator antibodies. In 
general, the dose equivalent of a modulator is from about 1 ng/kg to 10 mg/kg for the 
30 typical mammal. For administration, sensory modulators can be administered at a 
rate determined by the ED 50 of the modulator, and the side-effects of the inhibitor at 
various concentrations, as applied to the mass and overall health of the mammal. 
Administration can be accomplished via single or divided doses. 
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I. Kits 

Sensory receptor genes, or fragments or variants thereof are useful tools for 
identifying cells expressing sensory receptors, for forensics and paternity 
determinations, and for examining signal transduction in isolated cells. Sensory 
5 receptor family member-specific reagents that specifically hybridize to sensory 
receptor nucleic acids, such AOFLF1 probes and primers, and sensory receptor 
specific reagents that specifically bind to a sensory receptor protein, e.g., anti-sensory 
receptor antibodies are used to examine expression in cells and regulation of signal 
trans-duction. For example, one or more family member-specific reagents may be 
1 0 used to detect poly-morphisms that are linked to genetic anosmia or to detect allelic 
exclusion. 

Nucleic acid assays for the presence of DNA and RNA for a sensory receptor 
family member in a sample include numerous techniques are known to those skilled in 
the art, such as Southern analysis, Northern analysis, dot blots, RNase protection, SI 
1 5 analysis, amplification techniques such as PGR, and in situ hybridization. In in situ 
hybridization, for example, the target nucleic acid is liberated from its cellular 
surroundings in such as to be available for hybrid-ization within the cell while 
preserving the cellular morphology for subsequent interpretation and analysis. The 
following articles provide an overview of the art of in situ hybridization: Singer et al, 
20 Biotechniques, 4:230-250 ( 1 986); Haase et al, Methods in Virology, vol. VII, pp. 
189-226 (1984); Nucleic Acid Hybridization: A Practical Approach (Names et 
al, eds. 1987). In addition, a sensory receptor protein can be detected with the 
various immunoassay techniques described above. The test sample is typically 
compared to both a positive control {e.g., a sample expressing a recombinant sensory 
25 receptor protein) and a negative control. 

The present invention also provides for kits for screening for novel modulators 
of sensory receptor family members. Such kits can be prepared from readily available 
materials and reagents, as well as any of the aforementioned products. For example, 
such kits can comprise any one or more of the following materials: sensory receptor 
30 nucleic acids or proteins, reaction tubes, and instructions for testing sensory receptor 
activity. Optionally, the kit contains a biologically active sensory receptor. A wide 
variety of kits and components can be prepared according to the present invention, 
depending upon the intended user of the kit and the particular needs of the user. 
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Examples 

NO: 1) 

CCATCACTGAATTCATTCTCCTGGQAT^CAAG^^ GGCTCATCATTGTGG crA 
GTGCTTnCTTGGGTATGWrcTGG^ 



10 



CCATCACTGAATTCATT^ 

XTTCATACCCC ' " " 
GATATTTCCTCCATTTCCAACTCAGT^ 

AGCTTCTTTTCCTATGTCTG^^ 

AGTGGAAAGCCTTCTCCAOTGTG^^^ 

CATTGTAGGCGTGTACI^CCC^^^ 

25 £S£S^^ 

ID NO: 2) 



3) 

35 AT^l^TOOniTAAOaj^ 

ATTACCCAAAGCHTCAGATTCCTCTCTT^^ 
GTAGGAAACCITGGGATGATCATAATAATC 

ACTTTTTCCTTAGTCACCTCTC^ 

40 TGCTTGAGAACTrGGTAATGGCAG^ 

CTTCCrGTCCTGCACTGCTGTC^^ 
CGCTTTGTGGCCATCTGCAATCCTCTGCTriAl/^ 

CCCTGCTGGTGGCTGGGTCAT^ 
GCTCTCCGGTTAAACTTCTCTCG^ 
45 TCTCATCTCTGTGTCTGGCTCTCATATA^^ 

ccttcaatgagatgtg™^ 

GTACTAAAAATCCGTTCTGTTAGTGGGCGCCAC^^ 
TGACTGCTATCACCATCTKX^^ 

TACACAAGTTCCATTTCACTGA (SEQ ID NO: 4) 



50 



55 S^SSSSSo—™— 
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NQWLOTI^TT^EISTLLA^TSYAFIVVTIIJCMR5VSGRRKAFSTCASHLTAI77FHGTILFLY^ 
. PNSKJvISRHTVKVASVFYTVVTPMLWLIYSLRNKDVKDTVTEILDTKVFSY (SEQ ID NO: 5) 

ATGCTGCTGACAGATAGAAATACAAGTGGGACCACGTTCACCCTCITGGGCTTCTCAGATT 
5 ACCCAGAACTGCAAGTCCCACTCTTCCTGGTTTTTCTGGCCATCTACAATGTCACTGTGCTA 
GGGAATATTGGGTTGATTGTGATCATCAAAATCAACCCCAAACTGCATACCCCCATGTACT 
TTTrCCTCAGCCAACTCTCCTTTGTGGATTTCTGCTATTCCTCCATCATTGCTCCCAAGATG 
TTGGTGAACCTT GTTGT CAAAGACAGAACCATTTCATTTTTAGGATGCGTAGTACAATTCT 
TTTTCTTCTGTACCTTTGTGGTCACTGAATCCTTTTTATTAGCTGTGATGGCCTATGACCGC 

1 0 TTCGTGGCCATTTGCAACCCTCTGCTCTACACAGTTGACATGTCCCAGAAACTCTGCGTGC 
TGCTGGTTGTGGGATCCTATGCCTGGGGAGTCTCATGTTCCTTGGAACTGACGTGCTCTGC 
TTTAAAGTTATGTTTTCATGGTTTCAACACAATCAATCACITCTTCTGTGAGTTCTCCTCAC 
TACTCTCCCTTTCTTGCTCTGATACTTACATCAACCAGTGGCTGCTATTCTTTCTTGCCACC 
TTTAATGAAATCAGCACACTACTCATCGTTCTCACATCTTATGCGTTCATTGTTGTAACCAT 

1 5 CCTCAAGATGCGTTCAGTCAGTGGGCGCCGCAAAGCCTTCTCCACCTGTGCCTCCCACCTG 

ACTGCCATCACCATCTTCCATGGCACCATCCTCTrCCITTACTGTGTGCCCAACTCCAAAAA 

CTCCAGGCACACAGTCAAAGTGGCCTCTGTGTTTTACACCGTGGTGATCCCCATGTTGAAT 

CCCCTGATCTACAGTCTGAGAAATAAAGATGTCAAGGATACAGTCACCGAGATACTGGAC 
ACCAAAGTCTTCTCTTACTGA (SEQ ID NO: 6) 

20 

AOLFR4 sequences: 

MENQNNVTEFILLGLTENLELWKIFSAVFLVMYVATVLENLLIVVTIITSQSLRSPMYFFLTFLS 

LLDVMFSSVVAPKVIVDTLSKSTTISlJCGCLTQLFVEHFFGGVGIILLT\nvlAYDRYVAICKPLHY 

TIIMSPRVCCIJVtfVGGAWVGGFMHAMIQLLFMYQIPFCGPNIIDOT 

25 L\HXNSGMMCVAIFLILIASYTVILCSLKSYSSKGRHKALSTCSSHLTVVVLFFVPCIFLYMRPV 
VTHPIDKAMAVSDSIITPMLNPLIYTLRNAEYKSAMKKLWMKWEALAGK (SEQ ID NO: 7) 

ATGGAAAATCA AAAC AATGTGACTGAATTCATTCTTCTGGGTCTCACAGAGAACCTGGAGC 
TGTGGAAAATATTTTCTGCTGTGTTTCTTGTCATGTATGTAGCCACAGTGCTGGAAAATCT 

30 ACTTATTGTGGTAACTATTATCACAAGTCAGAGTCTGAGGTCACCTATGTA 1 1 1 1 1 I T CTTA 
CCTTCTTGTCCCTITTGGATGTCATGTTCrCATCTGTCGTTGCCCCCAAGGTGATTGTAGAC 
ACCCTCTCCAAGAGCACTACCATCTCTCTCAAAGGCTGCCTCACCCAGCTGTTTGTGGAGC 
ATTTCTTTGGTGGTGTGGGGATCATCCTCCTCACTGTGATGGCCTATGACCGCTACGTGGC 
CATCTGTAAGCCCCTGCACTACACGATCATCATGAGTCCACGGGTGTGCTGCCTAATGGTA 

35 GGAGGGGCTTGGGTGGGGGGATTTATGCACGCAATGATACAACTTCTCTTCATGTATCAAA 
TACCCTTCTGTGGTCCTrAATATCATAGATCACTTTATATGTGATTTGTTTCAGTTGTTGACA 
CTTGCCTGCACGGACACCCACATCCTGGGCCTCTTAGTTACCCTCAACAGTGGGATGATGT 
GTGTGGCCATCTTTCTrATCTTAATTGCGTCCTACACGGTCATCCTATGCTCCCTGAAGTCT 
TACAGCTCTAAAGGGCGGCACAAAGCCCTCTCTACCTGCAGCTCCCACCTCACGGTGGTTG 

40 TATTGTTCTTTGTCCCCTGTATTTTCTTGTACATGAGGCCTGTGGTCACTCACCCCATAGAC 

AAGGCAATGGCTGTGTCAGACTCAATCATCACACCCATGTTAAATCCCTTGATCTATACAC 

TGAGGAATGCAGAGGTGAAAAGTGCCATGAAGAAACTCTGGATGAAATGGGAGGCTTTGG 
CTGGGAAATAA (SEQ ID NO: 8) 



45 AOLFR5 sequences: 

MGKENCTTVAEFnXGLSDVPELRVCLFLLFLLrYGVTLLAN^ 
LSSVDFCYSSIIVPKMLANIFNKDKAISFLGCMVQFYLFCTCVVTEWLLAV 
LYTVTMSWKWVEI^SCCYFCGTVCSLIHLCLALRIPFYRSNVINHFFCDLPPVLSIJVCSDITVN 
ETLLFLVATLNESVTIMnLTSYLLILTmKMGSAEGRHKAFSTCASHLTAITVFHGTVLSIYCRP 
50 SSGNSGDADKVATVFyTVVIPMLNSVIYSLRNKDVKEAlJRKVMGSKIHS (SEQ ID NO- 9) 



ATGGGCAAGGAAAACTGCACCACTGTGGCTGAGTTCATTCTCCTTGGACTATCAGATGTCC 
CTGAGTTGAGAGTCTGCCTCTTCCTGCTGTTCCTTCTCATCTATGGAGTCACGTTGTTAGCC 
AACCTGGGCATGATTGCACTGATTCAGGTCAGCTCTCGGCTCCACACCCCCATGTACTTTT 
55 TCCrCAGCCACTTGTCCTCTGTAGATTTCTGCTACTCCTCAATAATTGTGCCAAAAATGTTG 
GCTAATATCTTTAACAAGGACAAAGCCATCTCCTTCCTAGGGTGCATGGTGCAATTCTACT 
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ACCGTGTTCTACACAGTCGTGATTCCTATGCTGAACTC 
,0 TOTOA^ACAGCCraTG^^GSAAAOAACCrCTCAOAAAAGTOATGOOCrC 



GTGGAGATGCTGACAAAGTGGCCAf 
TGTGATCTACAGCCTGAGAAATAAA 
CAAAATTCACTCCTAG (SEQ ID NO: 10) 



TCTTCT 
CTTCTT 

AATTGT i i i i ui^ i — - 



30 tattcaatgaggtgagcagcctaa™^ 

ATTATGAAGATGCGATCTGCAAGTGGGCG^ 
35 CACCAAATTGATTTACCACTGA (SEQ ID NO: 12) 



AOLFR7 sequences: 
MSYFYRLKLMKEAVLVKLP 

aiflimyllaavgnvliipaiysdp: 



PFIYSLRNKDMKRGLKKLQDRIYR (SEQ ID NO: 13) 
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CCCCTTCCTGTGTATCATCTTCTCCTACCTGCGAATCATGGTCACTGTGCTCAGAATCCCCT 
CTGCAGCCGGGAAGTGGAAGGCCTTCTCTACCTGTGGCTCCCACCTCACTGCAGTAGCCCT 

ACCGGGTAGCCACAGTTATGTACACAGTAGTGACACCCATGCTGAACCCTTTCATCTACAG 
GC J GAG ^ A ^ CA ^ GATATGAAGAGGGG ^ GAAGA ^ 

AOLFR8 sequences: 

MATSNHSSGAEFII^GLTQRPELQLPLFLIJLGIYVVTWG>n-GMIFLIALSSOLYPPVYYFI <5H 
10 LSFIDLCYSSVITPKMLVNFVPEENIISFLECITQLYFFLIFVMEGYLLTAMEYDRYVAIC^LI.Y 
OTVMSHRVCSIMMAVVYSLGFLWATVHTTRMS\n^ 

I^IICGVKIlATriJVVLISYAlTFSSILGmSTEGQSKAFGTCSSHIXAV^ 
TTMEKEKVSSVFYITIIPMLWLIYSLRNKDVKNALKKMTRGRQSS (SEQ ID NO: 15) 

i 5 ATGGCTACTTCAAACCATTCTTCAGGGGCTGAGTTTATCCTGGCAGGCTTGACACA Arcrr 
CAGAACTTCAACTGCCACrcrTCCTCCTGTC^ 

GAACCTGGGCATGATCTTCTTAATTGCTCTCAGrrCTCAACTITACCCTCCA^ 

GTGAACTTTGTTCCAGAGGAGAACATTATCTCCTTTCTGGAATGCATTACTCAACTTTATTT 
CTTCCrTATTTlTGTAATTGCAGAAGGCTACCTTCTGACAGCCATGGAATATGACCGTTAT 
OTGCTATCTGTCGCCCACTGCTITACAATATTGTCATGTCCCACAGGGT^ 

? A S^ TGGTATACTCACTGGG ™ CTGTGGG ^ 

GTGTTGTCATTCTGTAGGTCTCATACGGTCAGTCATTATTTTTGTGATATTCTCCCCTTATT 

GACTCTGTCTTGCTCCAGCACCCACATCAATGAGATTCTGCTGTTCATTATTGGAGGAGTT 

AATACCTTAGCAACTACACTGGCGGTCCTTATCTCTTATGCTTTCATTTTCTCTAGTATCCT 

TGGTATTCATTCCACTGAGGGGCAATCCAAAGCCTTTGGCACTTGTAGCTCCCATCrCTTG 

GCTGTGGGCATCTTTTTTGGGTCTATAACATTCATGTATTTCAAGCCCCCTTCCAGCACTAG 

TATGGAAAAAGAGAAGGTGTCTTCTGTGTTCTACATCACAATAATCCCCATGCTGAATCCr 

CTAATCTATAGCCTGAGGAACAAGGATGTGAAAAATGCACTGAAGAAGATGACTAGGGGA 
AGGCAGTCATCCTGA (SEQ ID NO: 1 6) 

AOLFR9 sequences: 

MLARNNSLVTEFILAGLTDRPEFWQPFFFLFLVIYIVTMVGNLGIJT^ 
LSFTOLCYSSVFIPKMLMNFVSKXMISN^ 

x^™ SHQVCSMLTFAA ™ GLAGATAH ^^ 
NEVVVLIWGTNITWSCTn.ISYWIVTSILHIKSTQGRSKA^ 

SSGSMEQGKWSVFYTNVVPMLNPLIYSLRNKDVKVALPJCALIKIQRRNIF(S 

ATGCTGGCTAGAAACAACTCCTTAGTGACTGAATTTATTCITGCTGGATTAACAGATCGTC 

? A S A ^ CTGGCAACCC ^ CT7mCCTG " CCTAG ^^^ 
AACOTGGCTTGATCACTCTTITCGGTCTAAA^ 

C1 ZI G I G Z GA ^ AAA ^ G ^ 



CGTCATCTCTGAATGTTACATGTTGACCTCAATGGCATATGATCGCTATGT 



TTTCTCTTTTTi 

CTTTTGCTGCITACATAATGGGATTGGCTGGAGCCACGGCCCACACCGGGTGCATGTTTAG 

actcaccttctgcagtgctaatatcattaaccattacttgtgtgacatS^tccccct 

AGCTTTCCTGCACCAGCACCTATGTCAACGAGGTGGTTGTTCTCATTGTTGTGGGTACTAA 
TATCACGGTACCCAGTTGTACCATCCTCATTTCTTATGTTTTCATTGTCACTA^^ATTCTTC 

A J^J CA ±^ C £ A H^ 

^^^^^ 

TACAGTTTGAGGAACAAGGATGTCAAAGTTGCACTGAGGAAAGCTCTGATTAAAATTCAG 
AGGAGAAATATATTCTAA (SEQ ID NO: 18) 



62 



15 



20 



PCT/US01/20122 

WO 01/98526 

10 AACCrTGGCTTGATC^cmCGOTCTAAAl^ mTCACTCCCAAAAT0 CTAAT 

GCITTCCTGCACCAGCACCTATGTCAAC 
ATCATGGTACCCAGTTGTACCA^TCCTCA^ 

GAAGAAATATATTCTAA (SEQ ID NO: 20) 

ATGACTCTCAGAAACAGCTGe^^ 
CAGAGCTCCAGCTCCCTCTrnOTCTA^^ 

AACTTGGGCTTGATC/^^™^™S^ 
35 CCTCTTCAAOTCTCOTATA^ 

40 GACCTTCHGTGATrCCAACGTCAJ^ 

TCTCCTGCACCAGCACCCATGTCAGTGAGCTGGTATT^ 

ATGCTATCCAGCATAAGCATCGTCATCT^ 

TCTAA (SEQ ID NO: 22) 
50 — 

55 SlSJS^Q^MIDaOCCQCSEQIDNO^) 
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^? G ^ G ^ CCAC ^ TCCAGAT ^^ GT ^ TGTTTTA AA 1 1 1 [ I J ITiTGCTGATAAGA 
AGAATAAAAGGAGAAATTTTGGACAGATTGTATCAGATGTTGGAAGAATCTGTTACAGTfi 
TTAGTTTATCTTTAGGTGAACCCACAACTATGGGAAGAAAT^^ 

ATTCATCCTCCTTGGACTCTCCTCTCGACCTGAGGATCAGAAGCCGCTC^ 
5 I CC £S ATCTACOTATCACAGTG ATAGGAAACCTGCTrATCATC^ 

^CTCGTCTCCAGACGCCCATGTACTTCTTTCTAAGCATCCTGTCITr^ 

ATGTGACAGTCATTATCCCTAAGATGCTGGTGAACITCTTATCAGAGACAAAGACCATCT 
^ACGGTGAGTGTCTGACCCAGATGTACTITTrcrrAGCCrTTGGAA^ 

CTGCTAGCAGCCATGGCCATTGACCGCTATGTGGCCATATGTAATCCCTTCCACTACATCA 

CCATTATGAGTCACAGATGCTGTGTCCTGCTTCTGGTTCTCTCCTTCTGCATTCCACATT^ 

CACTCCCTCCTGCACATTCTTCTGACTAATCAGCTCATCTTCTGTGCCTCCAATGTCATCCA 

TCAC 1 1 "1 "i "1CTGCGATGATCAACCAGTGCTAAAATTGTCCTGTTCCTCCCATTTTGTCAAAG 

AAATCACAGTAATGACAGAAGGCTTGGCTGTCATAATGACCCCGTTTTCATGCATCATCAT 

CTCTTATTTAAGAATCCTCATCACTGTTCTGAAGATTCCTTCAGCTGCTGGAAAGCGTAAA 

1 5 GCATTTTCTACCTGTGGCTCTCATCTCACAGTGGTGACCCTGTTTTATGGAAGCATTAGCTA 

TGTCTATTTTCAGCCCCTGTCCAACTATACTGTCAAGGATCAAATAGCAACAATTATCTAr 

ACCGTACTGACTCCTATGCTAAATCCATTTATCTATAGTCTGAGGAACAAAGACATC^C 
AGGGTTTGGCAAAGTTGATGCACAGGATGAAATGTCAGTAA (SEQ ID N024) 

AOLFR13 sequences: 

!£?S™^ GFILLGFSD ^^ 
SFLDLCYTTGIVPQIXV^RGADKSISYGGCW^^ 

LQPGNWSQDQGKXISLFYTIITPMINPLIYTLRNKDVKGALKK^ 

ZD) 

A Z G ^ A I CAGAAAAATGGAAOT ^ 

AACAAAACCATCATTGTATTATCTCACTTGGACCCACATCITCACAATCCTATGTATTTT^ 
CTTCTCCAACCTAAGCITTTTGGATCTGTGTTACACAACCGGCATTGTTCCACAGCT 

CTCTCTAGGCTTGGGATCTACAGAATGCGTTCrCTTAGGAGTGATGGCATT^^ 
G f AG ^ GmGCAGGCCCCTCC ACTACACAGTAGTCATGCACCCrTGTCTGTATGTC3CT 
TGGCTTCTACTTCATGGGTCATTGGTTTTGCCAACTCCCTATTGCAGACGCT 
CTTTTAACACTTTGTGGAAGAAATAAATTAGAACACTTTCTTTGTGAGGTTCCTCCATTG^ 
CAAGCTTGCCTGTGTTGACACTACTATGAATGAATCTGAACTCTTCTT^ 

T^^ GTACCT ^^ 

AGGATAAAGTCAGCAACAGGGCAGAGAAAAGTGTTTGGGACATGTGGCTCCCACCTCACA 
TCAGGATCAGGGCAAGKTCATCTCTCTCTTCTACACCATCATTACACCCATGA^^^C(^C 

^T^™^ CTGAGGAACAAGGATGTGAAAGGA ^ 

AACTACGACTCCAGATGA(SEQ ID NO: 26) 
AOLFR14 sequences: 

MALPLLI^PSCFASSQSLSSRMNSE1^TRAAVAPA£I^^ 
LGNMGNIALLIRMDARLHTP^^ 

^AGLADTECCLLAAMAYDRYVAIRNPLLYTTAMSQ 

RI^FCRSRKmSFFCDIPPLLAISCSDTSLNEIJXFAICGFIQTATVlW 
S^^^GGSHLTAVAMMYGTLIFM^^ 
VKEALRQTWSRFHCPGQGSQ (SEQ ID NO: 27) 

ATGGCCTTGCCATTGCTCTTATCTCCCTCCTGCTTrGCCTCTTCTCAGTCT 
GA J GAACTCAGAGAACCTCACCC ^ 
A I CAGA J^ TCGCTGGGACCTGC ^ 

TGAGCCTGCTGGGAAACATGGGCATGGCGCTGCTGATCCGCATGGATGCCCGGCTCCACA 
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GGCCCC^GATGCTAGTraACCTG^ 
CCCTCCAGATGTn-GTCTTTGCAGGTCTGGCT^T^G^ 

ggcctatgaccgctacgtggc<^tcagaa^^^ 
5 cgtctatgcctgg^gctgggagcat^^ 
acac^ccctcacotcggoc™^ 

CGATATCCCTCCACTGCTGGCCAICILUIU^^ 

TCGCCATCTGTGGCTTCATCC^ 
CATCGCrGGGGCTGTGATCCACATGCGCTCGGTCG^ 

ATGAGGGAAAATAACCJAGXCCTCTAC^^^^GV^^^^^O^rcCT^^M^GTWCTGGrC^GC 

CATGATAGCCTTGGGTAAC^AG^^^ 
GTGGCCATCAGCCACCCACTTCACTACACA^^ 
TTAITGCTGGGTCTTGGGTGA^GGAAATGCCAA 
30 TAGTCTGTCCTTCTGTGGCAACCAGG^ 
CTGAAGTTATCCTGTTCTCACATCCACm 

TTTTCTCTGTGCCATTACTATGCA^^ 

AGGTTCCTTCCACCAAGGGCGTGiCTCAAGGCC^ 

TGTCTCTTTGTATTATGGTACAGTCATGGG^ 

CTCCTCGTAA (SEQ ID NO: 30) 



.YGSBMSKSVCSFUTWYVYGALTC^™^^^^^^^^^^ 



45 



GTTCATTCTCCTGGGACTGACCAGTCGCCGG 




GGCCATCrGCAACCCTCTCCTTTATGGCAGCW 

ATCACGGTGCCTTATGTGTATGGAGCGC^CTOOIX^A ■ ^.^cc^afa 
ACCTAGCCrTCTGTGGCCCCAATGAAA^AATCAC^ACTWO^ GGCTO GCTGG 

55 s^SSS^?^^^™ 0 ^™ 
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AAGATTCGCTCTACAGAGGGCAGGCAAAAAGCTTTTTCTACCTGTGGCTCCCATCTGACAG 
CTGTCACTATATTCTATGCAACCCTTTTCTTCATGTATCTCAGACCCCCCTCAAAGGAATCT 

gttgaacagggtaaaatggtagctgtattttataccacagtaatccctatgctgaacctta 
taatttatagccttagaaataaaaatgtaaaagaagcattaatcaaagagctgtcaatga 

5 AGATATACTTTTCTTAA (SEQ ID NO: 32) ° A 
AOLFR17 sequences: 

MLNITDVTEFILLGLTSRREWQVIJFIIFLVVYIITMVGMGMMVLIKVSPOLNNPMYF^ 
FVDWFSSWTPKMLHvJLFSDKKTITYAGCLVQCFFFLA.LVHVEini^AA^FDRYMAIGNF^L 
YGSKMSRWCIRLITFPYIYGFLTSLAATLWTYGLYFCGKIEINHFYCADPPLIKMACAGTFVKE 
YTMni^GINFTYSLTVUISYLFILIAILRMRSAEGRQKAFSTCGSHLTAVIIFYGTLIFM 
ESVEQGKMVAWYTTVIPMLNPMIYSLRNKDVKJCAMMKV1SRSC (SEQ ID NO- 33) 



10 



20 



ATGCTCAATTTCACCGATGTGACAGAGTTCATTCTTTTGGGGCTAACGAGCCGTCGAGAAT 

1 5 GGCAAGTTCTCTTCTTCATCATCTTTCTTGTGGTCTACATCATCACCATGGTGGGCAATATC 

GGCATGATGGTGTTAATCAAGGTCAGTCCTCAGCTTAACAACCCCATGTACrTTTTCCTCA 

GTCACTTGTCATTTGTTGATGTGTGGTTTTCTTCCAATGTCACCCCrrAAAATGTTGGAAAAC 

CTGTTTTCAGATAAAAAAACAATTACTTATGCTGGTTGTTTAGTACAGTGTTTCTTCTTCAT 

TGCTCrTGTCCATGTGGAAATTTTTATTCTTGCTGCGATGGCCTTTGATAGATACATGGCAA 

TTGGGAATCCTCTGCTTTATGGCAGTAAAATGTCAAGGGTTGTCTGTATTCGACTGATTAC 
ITOCOTACATTTATGGTTTTCTGACGAGTO^ 

ACTTCTGTGGAAAAATTGAGATCAACCATTTCTACTGTGCAGATCCACCTCTCATCAAAAT 

GGCCTGTGCCGGGACCTTTGTAAAAGAATATACAATGATCATACTTGCCGGCATTAACTTC 

oc A ^^ TOCCTGACTGT ^^ATCATCTCTTACTTATTCATCCTCATTGCCArrCTGC 

25 GCGCTCAGCAGAAGGAAGGCAGAAGGCCTTTTCCACATGTGGGTCCCATCTGACAGCTGT 

CATTATATTCTATGGTACTCTGATCTTCATGTATCTCAGACGTCCCACAGAGGAGTCTGTG 

GAGCAGGGGAAGATGGTGGCTGTGTTCTATACCACAGTGATCCCCATGTTGAATCCCATGA 

TCTACAGTCTGAGGAACAAGGATGTGAAAAAGGCCATGATGAAAGTGATCAGCAGATCAT 
GTTAA (SEQ ID NO: 34) 

30 

AOLFR18 sequences: 

MSNTNGSAITEnLLGLTDCPELQSLIJ^FL\nm.VTLLGNLGMIMLMRLDSRLHTPMYF^ 
NLAFVDLCYTSNATPQMSTNIVSEKTISFAGCFTQCYIFIALLLTEFYMLAAMAYDRYVAIYDP 
LRYSVKTSRRVCICLATFPYA^GFSDGLFQAILTFRLTFCRSNVINHFy'CADPPLIKLSCSDTYVK 
EHAMFISAGFNLSSSLTIVLVSYAFILAAILPJKSAEGRHKAFSTCGSHMMAVTLFYGTLFCMYI 
RPPTDKTVEESKIIAVFYTFVSPVLNPLIYSLRNKDVKQALKNVLR (SEQ ID NO: 35) 

ATGTCCAACACAAATGGCAGTCCAATCACAGAATTCATTTTAOT 

*?™5 rcCAGTCTCTGCllmGTGCTG ^ CT ^ 

AAOTGGGCATGATAATGTTAATGAGACTGGACTCTCGCCTTCACACGCCCATGTACrTCT 
TCCTCACTAACTTAGCCTTTGTGGATTTGTGCTATACATCAAATGCAACCCCGCAGATGTC 

$^ TATCGTATCTGAGMGACWmc ^ GCTGG ™cmA^^ 

TCATTGCCCTTCTACTCACTGAGTTTTACATGCTGGCAGCAATGGCCTATGACCGCTATGT 
A * GGGCATA ^ TGACCCTCTGCGG TACAGTGTGAAAACGTCCAGGAGAGTTTGCATCT 
45 ^^ CA ^ C ^ ATG ^ATGGCTrCTCAGATGGACTCTTCCAGGCCA7CCrGACCrrcCG 

A J G j^ C I rGTTCTGATAC ™ 

CTCTCCAGCTCCCTCACCATCGTCTTGGTGTCCTATGCCTTCATTCTTGCTGCCATCCTCCG 
GATCAAATCAGCAGAGGGAAGGCACAAGGCATTCTCCACCTGTGGTTCCCATATGATGGC 
^^ A ^ CTGTTTTATGGGACTCTCTTTT ^ 

GTTGAGGAATCTAAAATAATAGCTGTCTTTTACACCTTTGTGAGTCCGGTACTTAATCCAT 

TGATCTACAGTCTGAGGAATAAAGATGTGAAGCAGGCCTTGAAGAATGTCCTGAGATGA 
(SEQ ID NO: 36) 

55 



35 



40 
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5 SS^ZSMI^MSLRWMKRGLm^ (SEQ ID NO: 37) 
AGCTGTCATTGTGACCCCCTT 

TGCTCAGAATCCCCTCTGCAG^™^— T" ~ CTATrrrAGGCC TCTGTCCATGT 
CACTGTAGTGGTCCTGTTCTATGGGAGTGTCA^ 



20 



ACAGAATTTACTCATAG (SEQ ID NO: 38) 
25 AOLFR20 sequences: 



35 TTT 



cS^G^A^ATAGGTAGTGTC^ 



W^GGI^CTIAFLLLLISYGVIIJISLKKLSQKGRQKAHSTCSSHIT 
^SmKsVsVFYTVITPMLNPUYTLRNSEMTSAMKKL (SEQ ID NO: 41) 



55 
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TCACCCTCTCCTTCTGTAAGGACAATC 

CTGAAGCTTCCCTGCAGTGACACAGCAAACATCGAGATTGTCATCATCT^ 
TTGTGATTTTGGCCAATGCCTCCGTCATCCTGATTTCCTATCTGCTCATCA 
TTGAAAGTGAAGTCTTCAGGTGGCAGGGCCAAGACnTTCTCCACATGTGCCTCTCACATCA 
5 CTGCTGTGGCCCTTTTCI 1 1 GGAGCCCTTATCTTCATGTATCTGCAAAGTGGCTCAGGCAAA 
TCTCTGGAGGAAGACAAAGTCGTGTCTGTCTTCTATACAGTGGTCATCCCCATG 

CTCTGATCTACAGCTTAAGAAACAAAGATGTAAAAGACGCCnTCAGAAAGGTCGCTAGGA 
GACTCCAGGTGTCCCTGAGCATGTAG (SEQ ID NO: 46) 



10 AOLFR25 sequences: 

METGNLTWVSDFWLGLSQTRELQR^ 
LAVLDLCFSSVTAPKMLVDLLSEKKTISYQGC^ 
RYVTVMNTQLWVGLWATWVGGFVHSIVQI^L^^ 
SLLEFOCSNSGLLDVVWFFLLLMSYIJFILVMLRSHPGEA^ 
1 5 AJ^FTPFPMDKLVSIGHTVMTPMLNPMIYTLRNQDM (SEQ ID NO: 47) 



ATGGAAACAGGGAACCTCACGTGGGTATCAGACTTTC 
GGGAGCTCCAGCGTTTCCTGTTTCTAATC 

AACATCCTTATCATCATCACAGTGACCTCTGATTCCCAGCTCCACACACCCATGTACITTCT 
20 GCTCCGAAACCTGGCTGTCCTAGACCrCTGTTTCTCTTCAGTCACTGCTC 

TGGACCTCCTCTCTGAGAAGAAAACCATCTCTTACCAGGGCTGCATGGGTCAGATCrTC 
CTTCCACTTTTTGGGAGGTGCCATGGTCT^^ 

TTGCCATCTCCCGGCCCCTCCGCTATGTCACCGTCATGAACACTCAGCTCTGGGTGGGGCT 
GGTGGTAGCCACCTGGGTGGGAGGCTTTGTCCACTCTATTGTCCAGCTGGCTCTGATGCT 

25 CCACTGCCCITCTGTGGCCCCAACATTTTGGATAACTTCTACTGTGATGTTCCCCAAGTACT 
GAGACHTGCCTGCACTGACACCTCACTGCTGGAGTTCCTCAAGATCTCCAACAGTGGGCTG 
CTGGATGTCGTCTGGTT(m , CCTCCTCCTGATGTCCrACrTATTCATCCT 
GTCACATCCAGGGGAGGCAAGAAGGAAGGCAGCTTCCACCTGCACCACCCACATCATCGT 
GGTTTCCATGATCTTCGTTCCAAGCATTTACCTCTATGCCCGGCCCTTCACT 

30 TGGACAAGCTTGTGTCCATCGGCCACACAGTCATGACCCCCATGCTCAACCCCATGATCTA 
TACCCTGAGGAACCAGGACATGCAGGCAGCAGTGAGAAGATTAGGGAGACACCGGCTGGT 
TTGA (SEQ ID NO: 48) 



AOLFR26 sequences: 

3 5 MAAKNSS VTEFILEGLTHQPGLRIPIJ^FIJFLGF YTVTW 
LroFCFSTTITPKMLMSFVSRKNIISFTGC 
VTMSCQVCLLLLLGAYGMGFAGAMAHTGSIMNLT^ 
ELVVFIWAVDVGMPIVTVFISYALILSSILHNSSTEGRSKAFSTCSSmiW 
LSILPLEQGKVSSLFYTnWVLNPLIYSLRNlG)VKVALRRTLGRKIFS (SEQ ID NO: 49) 

40 

ATGGCAGCCAAAAACTCTTCTGTGACAGAGTTTA 
GACTGCGGATCCCCCTCTTCTTCCTGTTTCTGGGTTTCT 
CCTOTGCTTGATAACCCTGA 
TTTTTAACCTCTCITTAATAGATTTCT 
45 AGTTTTGTCTCAAGGAAGAACATCATTTCCTTCACAGGGTGTATGACTCAGCT 

CTGC I "I CI "1 "J GTCGTCTCTG AGTCCTTCATCCTGTCAGCG ATGGCGTATGACCGCTACGTGG 
CCATCTGTAACCCACTGTTGTACACAGTCACCATGT^ 

TTGGGTGCCTATGGGATGGGGTTTGCTGGGGCCATGGCCCACACAGGAAGCATAATGAAC 
CTGACCTTCTGTGCTGACAACCTTGTCAATCATTrCATGTGTGACATCCT 
50 GCTCTCCTGCAACAGCTCTTACATGAATGAGCTGGTGGTCTTTATTGTGGTGGCT 
GTTGGAATGCCCATTGTCACTGTCTTTATTTCTTATGCCCTCATCCT 
CAACAGTTCTACAGAAGGCAGGTCCAAAGCCTTTAGTACTTGCAGTTCCCAC^ 
GTTTCTCTTTTCTTTGGTTCTGGTG 

GAGCAAGGGAAAGTGTCCTCCCTGTTCTATACCATAATAGTCCCCGTGTTAAACCCATTAA 
55 TCTATAG(m , GAGGAACAAGGATGTCAAAGTTGCCCrGAGGAGAACTTTGGGC^ 
TCTTTTCTTAA (SEQ ID NO: 50) 
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AOLFR27 sequences: 

MPSQNYSnSEFNLFGFSAFPQHLLPILFLLYLLMFLFTLLGNLLIMATIWIEHRLHTPMYLFLCTL 
SVSEILFWAITPRMLADLLSTHHSITFVACANQMFFSFMFGFTHSFLLLVMGYDRYVAICHPLR 
5 YNVLMSPRDCAHLVACTWAGGSVMGMMVTTIWHLTFCGSNV1HHFFCHVLSLLKLACENKT 
SSV1MGVMLVCVTALIGCLFLIILSYVF1VAAILRIPSAEGRHKTFSTCVSHLTVWTHYSFASFIY 
LKPKGLHSMYSDALMATTYTVFTPFLSPIIFSLRNKELKNAINIQ^YRKFCPPSS (SEQ ID NO: 

51) 

10 ATGCCTAGTCAGAACTATAGCATCATATCTGAATTTAACCTCTTTGGCTTCTCAGCCTTCCC 
CCAGCACCTCCTGCCCATCTTGTTCCTGCTGTACCTCCTGATGTTCCTGTTCACATTGCTGG 
GCAACCTTCTCATCATGGCCACAATCTGGATTGAACACAGACTCCACACACCCATGTACCT 
CTTCTTGTGCACCCTCrCCGTCrCTGAGATTCTGTTCACTGTTGCCATCACCCCTCGCATGC 
TGGCTGATCTGCTTTCCACCCATCATTCCATCACCrrrTGTGGCTTGTGCCAACCAGATGTTC 

1 5 TrcTCCTTCATGTTTGGCnTCACTCACTCCTTCCTTCrCCTGGTCATGGGCTATGATCGCTA 
TGTGGCCATCTGCCACCCACTGCGTTACAATGTGCTCATGAGCCCCCGTGACTGTGCCCAT 
CTTGTGGCCTGTACCTGGGCTGGTGGCTCAGTCATGGGGATGATGGTGACAACGATAGTTT 
TCCACCTCACTTTCTGTGGGTCTAATGTGATCCACCATTTTTTCTGTCATGTGCTTTCCCTCT 
TGAAGTTGGCCTGTGAAAACAAGACATCATCTGTCATCATGGGTGTGATGCTGGTGTGTGT 

20 CACAGCCCTGATAGGCTGTTTATTCCTCATCATCCTCTCCTATGTCTTCATTGTGGCTGCCA 
TCTTGAGGATTCCCTCTGCCGAAGGCCGGCACAAGACATTTTCTACGTGTGTATCCCACCT 
CACTGTGGTGGTCACGCACrATAGTTTTGCCTCCTTTATCTACCTCAAGCCCAAGGGCCTCC 
ATTCTATGTACAGTGACGCCTTGATGGCCACCACCTATACTGTCTTCACCCCCTTCCTTAGC 
CCAATCATTTTCAGCCTAAGGAACAAGGAGCTGAAGAATGCCATAAATAAAAACTTTTACA 

25 GAAAATTCTGTCCTCCAAGTTCCTGA (SEQ ID NO: 52) 

AOLFR28 sequences: _ 
MPNFTDVTEFIlXGLTCRQELQVIJFFVVFIAVYMriXLGNIGMIIUSISPQLQ 
ADVCFSSNVTPKMLENLLSETKT1SYVGCLVQCYFFIAVVHVEVYILAVMAFDRYMAGCXPLL 
30 YGSKMSRTVCVRLISVXYXYGFSVSLICTLWTYGLYFCGNFEINHFYCADPPLIQIACGRVHIKE 
ITMIVIAGINFTYSLSVVLISYTLIWAVLRMRSADGRRKAFSTCGSHLTAVSMFYGTPIFMYLR 
RPTEESVEQGKMVAWYTTVn>ML>n>MIYSLRNKDVKEAVNKAITKTYVRQ (SEQ ID NO: 53) 

ATGCCTAATITCACGGATGTGACAGAATTTACTCTCCTGGGGCTGACCTGTCGTCAGGAGC 
35 TACAGGTTCTCITTTTTGTGGTGTTCCTAGCGGTTTACATGATCACTCrGTTGGGAAATATT 
GGTATGATCATTTTGATTAGCATCAGTCCTCAGCTrCAGAGTCCCATGTACTTTTTCCTGAG 
TCATCTGTCTTTTGCGGACGTGTGCTTCrCCTCCAACGTTACCCCCAAAATGCTGGAAAACT 
TATTATCAGAGACAAAAACCATTTCCTATGTGGGATGCTTGGTGCAGTGCTACTTTTTCAT 
TGCCGTTGTCCACGTGGAGGTCTATATCCTGGCTGTGATGGCCTTTGACAGGTACATGGCC 
40 GGCTGCAANCCTCTGCTTTATGGCAGTAAAATGTCTAGGACTGTGTGTGTTCGGCTCATCT 
CTGTGNNNTATGNNTATGGATTCTCTGTCAGCCTAATATGCACACTATGGACTTATGGCTT 
ATACTrCTGTGGAAACrTTGAAATCAATCACTTCTATTGTGCAGATCCCCCTCTCATCCAGA 
TTGCCTGTGGGAGAGTGCACATCAAAGAAATCACAATGATTGTTATTGCTGGAATTAACTT 
CACATATTCCCTCTCGGTGGTCCTCATCTCCTACACTCTCATTGTAGTAGCTGTGCTACGCA 
45 TGCGCTCTGCCGATGGCAGGAGGAAGGCGTTCTCCACCTGTGGGTCCCACTTGACGGCTGT 
TTCTATGTTTTATGGGACCCCCATCTTCATGTATCTCAGGAGACCCACTGAGGAATCCGTA 
GAGCAGGGCAAAATGGTGGCTGTGTTTTACACCACAGTAATTCCTATGTTGAATCCCATGA 
TCTACAGTCTGAGAAATAAGGATGTAAAAGAAGCAGTCAACAAAGCAATCACCAAGACAT 

ATGTGAGGCAGTAA (SEQ ID NO: 54) 

50 

AOLFR29 sequences: 

MMSFAPNASHSPWLLLGFSRAMSYTLLFFLFLATY1.TTILGNVTLVLLISWDSRLHSPMYYLLR 
GLSVIDMGLSTVTLPQLLAHLVSHYPTIPAARCLAQFFFFYAFGVTDTLVIAVMALDRYVAICD 
PLHYALVMNHQRCACLLALSWWSILHTMLRVGLVLPLCWTGDAGGNVNLPHFFCDHRPLLR 
55 ASCSDIHSNELAIFFEGGFO^LGPCALIVLSYVRIGAAIIJILPSAAGRRRAVSTCGSHLTMVGFL 
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TCCTCTGCCATTTGGCCTTTGTAGACATTGGGTACTCCTCATCAGTCACACCTGTCATGCTC 

ATGAGCTTCCTAAGGAAAGAAACCTCTCTCCCTGTTGCTGGTTGTGTGGCCCAGCTCTGTT 

CTGTAGTGACGTTTGGTACGGCCGAGTGCTTCCTGCTGGCTGCCATGGCCTATGATCGCTA 

TGTGGCCATCTGCTCACCCCTGCTCTACTCTACCTGCATGTCCCCTGGAGTCTGCATCATCT 

5 TAGTGGGCATGTCCTACCTGGGTGGATGTGTGAATGCTTGGACATTCATTGGCTGCTTATT 

AAGACTGTCCTTCTGTGGGCCAAATAAAGTCAATCACTTTITCTGTGACrATTCACCACTTr 

TGAAGCTTGCTTGTTCCCATGATTTTACnTTTGAAATAATTCCAGCTATCTCTTCTGGATCT 

ATCATTGTGGCCACTGTGTGTGTCATAGCCATATCCTACATCTATATCCTCATCACCATCCT 

GAAGATGCACTCCACCAAGGGCCGCCACAAGGCCTTCTCCACCTGCACCTCCCACCTCACT 

1 0 GCAGTCACTCTGTTCTATGGGACCATTACCTTCATTTATGTGATGCCCAAGTCCAGCTACTC 

AACTGACCAGAACAAGGTGGTGTCTGTGTTCTACACCGTGGTGATTCCCATGTTGAACCCC 

CTGATCTACAGCCTCAGGAACAAGGAGATTAAGGGGGCTCTGAAGAGAGAGCTTAGAATA 
AAAATATTTTCTTGA (SEQ ID NO: 60) 



15 AOLFR32 sequences: 

MNSLKDGNHTALTGFILLGLTDDPILRVILFMIILSGhlLSm^ 

AYSSSVTP>nWL\WLVERNTVSYLGCAIQLGSAAFFATVECVLLAAMAYDRFVAICSPLLYSTK 
MSTQVSVQLLLVVY1AGFLIAVSYTTSFYFLLFCGPNQVNHFFCDFAPLLELSCSDISVSTVVLSF 
SSGSIIVVTVCVIAVCYIYILITILKMRSTEGHHKAFSTCTSHLTVVTIJ^GTITFIYVMPN^ 
20 DQ>TC.WSVLYTVVIPMLNPLIYSLRNKEIKGALKM f S EO ID 

NO: 61) 



ATGAATTCCCTGAAGGACGGGAATCACACCGCTCTGACGGGGTTCATCCTATTGGGCTTAA 
CAGATGATCCAATCCTTCGAGTCATCCTCTTCATGATCATCCTATCTGGTAATCTCAGCATA 

25 ATTATTCTTATCAGAATTTCTTCTCAGCTCCATCATCCTATGTATTTCTTTCTGAGCCACTT 
GGCTTTTGCTGACATGGCCTATTCATCTTCTGTCACACCCAACATGCTTGTAAACTTCCTGG 
TGGAGAGAAATACAGTCTCCTACCTTGGATGTGCCATCCAGCTTGGTTCAGCGGCTTTCTT 
TGCAACAGTCGAATGCGTCCTTCTGGCTGCCATGGCCTATGACCGCTTTGTGGCAATTTGC 
AGTCCACTGCTTTATTCAACCAAAATGTCCACACAAGTCAGTGTCCAGCTACTCTTAGTAG 

30 TTTACATAGCTGGTTTTCTCATTGCTGTCTCCTATACTACTT 

GTGGACCAAATCAAGTCAATCATTTTTTCTGTGATTTCGCTCCCTTACTTGAACTCTCCTGT 
TCTGATATCAGTGTCTCCACAGTTGTTCrCTCATTTTCTTCTGGATCCATCATTGTGGTCAC 
TGTGTGTGTCATAGCCGTCTGCTACATCTATATCCTCATCACCATCCTGAAGATGCGCTCCA 
CTGAGGGGCACCACAAGGCCTTCTCCACCTGCACTTCCCACCTCACTGTGGTTACCCTGTT 
35 CTATGGGACCATTACCTTCATTTATGTGATGCCCAATTTTAGCTACTCAACTGACCAGAAC 
AAGGTGGTGTCTGTGTTGTACACAGTGGTGATTCCCATGTTGAACCCCCTGATCTACAGCC 
TCAGGAACAAGGAGATTAAGGGGGCTCTGAAGAGAGAGCTTGTTAGAAAAATACTTTCTC 
ATGATGCTTGTTATTTTAGTAGAACTTCAAATAATGATATTACATAG (SEQ ID NO: 62) 

40 AOLFR34 sequences: 

MLEGVEHLLLLLLLTDVNSKJELQSGNQTSVSHFILVGLHHPPQLGAPIJ^LAFLVrVTLTVSGNG 
Lin-TVLVDnU-HRPMCLFLCHLSFLDMTISCArVTKMLAGFLLGSRIISFGGCVIQLFSFHFLGCT 
ECFLYTLMAYDRFLAICKPLHYATIMTHRVCNSIALGTWLGGTIHSLFQTSFVFRLPFCGPNRV 
DYIFCDn>AMLRLACADTAINELVTFADIGFLALTCFMLn.TSYGYIVAAJLRIPSADGRRNAFST 
45 CAAHLTWIVYYWCTFIYLRPCSQEPLDGWAVFYTVITPLLNSir^ 
HKEVQPH (SEQ ID NO: 63) 



ATGTTAGAGGGTGTTGAGCATCTCCTTCTGCTACTrcrTTTGACAGATGTGAACAGCAAGG 
AACTGCAAAGTGGAAACCAGACTTCTGTGTCTCACTTCATTTTGGTGGGCCTGCACCACCC 

50 ACCACAGCTGGGAGCGCCACTCTTCTTAGCTTTCCTTGTCATCTATCTCCTCACTGTTTCTG 
GAAATGGGCTCATCATCCTCACTGTCTTAGTGGACATCCGGCTCCATCGTCCCATGTGCTT 
GTTCCTG TGTCA CCTCTCCTrCTTGGACATGACCATTTCTTGTGCTATTGTCCCCAAGATGC 
TGGCTGGCTTTCTCTTGOGTAGTAGGATTATCTCCTTTGGGGGCTGTGTAATCCAACTATTT 
TCTTTCCATTTCCTGGGCTGTACTGAGTGCnrTCCTTTACACACTCATGGCTTATGACCGTTT 

55 CCTrGCCATTTGTAAGCCCTTACACTATGCTACCATCATGACCCACAGAGTCTGTAACTCCC 
TGGCTTTAGGCACCTGGCTGGGAGGGACTATCCATTCACTTTTCCAAACAAGTTTTGTATT 
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?aSca^ctg?a^^^ 

GTGCAGCCTCACTGA (SEQ ID NO: 64) 

10 ======= 



15 



65) 

ATGGAGCCGCTCAACAGAACAGAGGTGTCCGAGTT 1 



CTTTCTGAAAGGATTTTCTGGCTACC 



, ^ t< \TCTGCTCTTCCCTCTGTCCTCAGCCATCT 



GCAGGAGTCGGGCCTCCAGGTGA (SEQ ID NO: 66) 
40 SSd^I^ 



rGGTTTTCrTCCCATTTCAGGGATCCTTTTGTCTTAC 
TATAAAATTGTCCCCTCCATTCTAAGGATTTCATCATCAGATGGGAAGTACAAA 

^^^^ 

55 ?Jctg?gSagggtgtgcaacaaa^ 
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TGTGGTTGAGAAAGGGCAACCACATTCAATCCCTACATCTGCAAATCCTGCCCCTTAG 
(SEQ ID NO: 68) 



AOLFR37 sequences: 
5 MEKANETSPVMGFVLLRI^AHPELEKTFFVLILLMYLVILLGNGV 

NLSFIJ)ICFTTSSVPLVLDSFLTPQETISFSACAVQMALSFAMAGTECLLLSM 

LRYSVIMSKAAYMPMAASSWAJGGAASVVHT^ 

INVISMEVTNWLGWVLFISFSYWIITTILRIPSAEGRKKVFSTCS 

KPKSKDSMGADKEDLSDKLIPLFYGWTPMLNPUYSLRNKDXHC^ (SEQ ID 

10 NO: 69) 



ATGGAAAAAGCCAATGAGACCTCCCCTGTGATGGGGTTCGTTCTCCTGAGGCTCTCTGCCC 
ACCCAGAGCtGGAAAAGACATTCTTCGTGCTCATCCTGCTGATGTACCTCGTGATCCTGCT 
GGGCAATGGGGTCCTCATCCTGGTGACCATCCTTGACTCCCGCCTGCACACGCCCATGTAC 

1 5 TTCITCCTAGGGAACCTCTCCTTCCT 

CCTGG ACAGCT1 " 1 1"I'G ACTCCCCAGG AAACCATCTCCITCTCAGCCTGTGCTGTGCAG ATG 
GCACTCTCCTTTGCCATGGCAGGAACAGAGTGCTTGCTCCTGAGCATGATGG 
GCTATGTGGCCATCTGCAACCCCCTTAGGTACTCCGTGATCATGAGCAAGGCTGCCTACAT 
GCCCATGGCTGCCAGCTCCTGGGCTATTGGTGGTGCTGCTTCCGTGGTACACACATCCTTG 

20 GCAATTCAGCTGCCCTTCTGTGGAGACAATGTCATCAACCACTTCACCTGTGAGATTCTC 

CTGTTCTAAAGTTGGCCTGTGCTGACATTTCCATCAATGTGATCAGCATGGAGGTGACGAA 
TGTGATCTTCCTAGGAGTCCCGGTTCTGTTCATCTCTTTCTCCTATGTCTTCATCATCACCA 
CCATCCTGAGGATCCCCTCAGCTGAGGGGAGGAAAAAGGTCTTCTCCACCTGCTCTGCCCA 
CCTCACCGTGGTGATCGTCTTCTACGGGACCITATTCTTCATGTATGGGAAGCCTAAGTCT 

25 AAGGACTCCATGGGAGCAGACAAAGAGGATCTTTCAGACAAACTCATCCCCCTTTTCT 

GGGTGGTGACCCCGATGCTCAACCCCATCATCTATAGCCTGAGGAACAAGGATGTGAAGG 
CTGCTGTGAGGAGACTGCTGAGACCAAAAGGCTTCACTCAGTGA (SEQ ID NO: 70) 



AOLFR38 sequences: 

30 MYLVTVLRNLLIIJLA.VSSDSHLHTPMCFFLSNLCWA 
LTQMSFFVTJFACIEDMLLTVMAYDRFVAICH^ 
IVLQFTFFKhTV^ISNFVCDPSQLLN^ 

SDRKSKAFSTCGSHLAWCLFYGTGIGVTLTSAVSPPPRNGWASVMYAVV^ 
NRDIQSALWRLRSRTVESHDLLSQDLLHPFSCVGEKGQPH (SEQ ID NO: 71) 

35 

ATGTACCTGGTCACGGTGCTGAGGAACCTGCTCATCATCCTGGCTGTCAGCTCTGACTCCC 
ACCTCCACACCCCCATGTGCTTCTTCCT 

TCGGCCATGGTTCCCAAGATGATTGTGGACATGCAGTCGCATAGCAGAGTCATCTCTTATG 
CGGGCTGCCTGACACAGATGTCTTTCTTTGTCCTTTTTGCA^ 
40 ACAGTGATGGCCTATGACCGATTTGTGGCCATCTGTCACCCCCTGCACTACCCAGTCATCA 
TGAATCCTCACCTTGGTGTCITC^ 

CAGCTGCACAGTTGGATTGTGTTACAATTCACCTTCITCAAGAATGTGGAAATCTC 

TTGTCTGTGACCCATCTCAACTTCTCAACCTTGCCTGTTCTGACAGTGTCATCAA 

TTCATATATTTAGATAGTATTATGTTTGGTTTTCTTCCCAT^ 

45 GCTAACAATGTCCCCTCCATTCTAAGAATTTC 

CCACCTGTGGCTCTCACCTGGCAGTTGTTTGCTTATTTTATGGAACAGGCATTGGCGTGT^ 
CCTGACTTCAGCTGTGTCACCACCCCCCAGGAATGGTGTGGTGGCATCAGTGATGTACGCT 
GTGGTCACCCCCATGCTGAACCCTTTCATCTACAGCCTGAGAAATAGGGACATTCAAAGTC 
CCCTGTGGAGGCTGCGCAGCAGAACAGTCGAATCTCATGATCTGTTATCTCAAGATCTGCT 

50 CCATCCTTTTTCTTGTGTGGGTGAGAAAGGTCAACCACATTAA (SEQ ID NO: 72) 



AOLFR39 sequences: 

MGVXNHSTVTEFLI^GLTEQAELQLPLFCLFLGIYTVTVVGNI^ 
^FLDFCYSSVITPKMLSGI^CRDRSISYSGCMIQLFFFCVC^ 
55 LYRYIMSPRVCSLLVAAWSVGFTOAVIHGGCnjR^ 
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LIFV1GGFNMVATSLTIIISYAFILTS1LMHSKKGRCKAFSTCSSHLTAVLMPTGSLM 
SSSLTQEK^SVFYTTVILMLNPLIYS^ (SEQ © NO: 73) 

ATGGGTGTAAAAAACCATTCCACAGTGACTGAGTTTCTTC^ 
5 CAGAGCTTCAGCTGCCCCTCTTCTGCCTCTTCTTAGGAATTTACACAGTTACT 
AAACCTCAGCATGATCTCAATTATTAGGCTGAATCGTCAACTTCATACCC^ 

TTCCTG^G^AG'^^ 

ATr^^rr^TTTT^TGCAGAGATAGATCCATCTCCT 
^CTGT™^ 



10 i 



20 



25 



35 



45 



;;^ A "^i-TfifirCACAAGCCTAACAATCATTATTTCATATGCrTTTATCCTCACCAGCATCCT 

1 < ^gc1?c?actcta^gggca^ 
15 gSgto^gttt^ggt^ 
aScSggag^ 

^gatatatagtctgaggaacaatgaagtaagaaatgctctgatgaaacttttaagaaga 

AAAATATCTTTATCTCCAGGATAA (SEQ ID NO: 74) 



AOLFR40 sequences: m _ 

^^^^^^^^^^^ 

CT^SSSI^ ^VF^WI^VVYTLl^VKICAUKLKNGS VFAQOE (SEQ ID NO: 
75) 



ITTTT 



rru r rn-GTCCTTCATTGACATGTGGTTCTCC ACTGTCACGGTGCCCAAAATGCTGATCAC 



TPr A rT-rrPTOGGGAGC ACCGAGTGTTTCCTCTACAC AGTCATGTCCTATGATCGCTACCT 
GGC^A^A^^^ 

OCCACCGGCAOTGGCTCAGTGGCTCTCTGCAGTCTGCTGTCC 
G^CTGGCCTOTGCAGA 

CTTnCATGGGGT^GTGGC 

Scc?gaga^^^ 
ttgctcagggtgaatag(seq id no: 76) 



AOLFR41 sequences: 

MNPENWTQVTSFVLLGFPSSHLIQFLWLGLMVTYTVT^ 

yetSghvcsql^ 

(SEQ ID NO: 77) 

a xr a A rrCTGAAAACTGGACTCAGGTAACAAGCTTTGTCCTTCTGGGTTTCCCCAGTAGCC 

acc^a?a?ag^^ 
cSctgctaaS^ 
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TTCCTGCGGAATTTCTCCITCCTGGAGCrGTTGCTGGTAACTGTTGTGGTTCCCAAGATGCT 
TGTCGTCATCCTCACGGGGGATCACACCATCTCATTTGTCAGCTGCATCATCCAGTCCTACC 
TCTACTTCrTTCTAGGCACCACTGACTTCTTCCTCTTGGCCGTCATGTCTCTGGATCGTTAC 
CTGGCAATCTGCCGACCACTCCGCTATGAGACCCTGATGAATGGCCATGTCTGTTCCCAAC 
5 TAGTGCTGGCCTCCTGGCTAGCTGGATTCCTCTGGGTCCTTTGCCCCACTGTCCTCATGGCC 
AGCCTGCCTTTCTGTGGCCCCAATGGfATTGACCACTTCTTTCGTGACAGTTGGCCCTTGCT 
CAGGCTTTCTTGTGGGGACACCCACCTGCTGAAACTGGTGGCTTTCATGCTCTCTACG7TG 
GTGTTACTGGGCTCACTGGCTCTGACCTCAGTTTCCTATGCCTGCATTCTTGCCACTGTTCT 
CAGGGCCCCTACAGCTGCTGAGCGAAGGAAAGCGTTTTCCACTTGCGCCTCGCATCTTACA 
1 0 GTGGTGGTCATCATCTATGGCAGTTCCATCTTTCTCTACATTCGTATGTCAGAGGCTCAGTC 
CAAACTGCTCAACAAAGGTGCCTCCGTCCTGAGCTGCATCATCACACCCCTCTTGAACCCA 
TTCATCTTCACTCTCCGCAATGACAAGGTGCAGCAAGCACTGAGAGAAGCCTTGGGGTGGC 
CCAGGCTCACTGCTGTGATGAAACTGAGGGTCACAAGTCAAAGGAAATGA (SEQ ID NO 
78) 

15 

AOLFR42 sequences: 

MOTANHSQVAGFVLLGLSQVWELRFVTFTWSAVW^ 

GNLSFLDFCYSSITAPRMLVDLLSGNPTISFGGCLTQIJFFFHFIGGIKIFLLTVMAYDRY1AJSQPL 
HYTLlMNQWCALLMAASWVGGFmSr/QIALTIQLPFCGPDKLDNFYCDWQLIKLACTDTFV 
20 LELLMVSNNGLVTLMCFLVLLGSYTALLVMOlSHSREGRSKALSTCASHMVVTLIFVPCryvy 
TRPFRTFPMDKAVSVLYTIVTPMLNPArYTLRNKEVIMAMKKLWPJUCKDPIGP^ 
(SEQ ID NO: 79) 



ATGAATCCAGCAAATCATTCCCAGGTGGCAGGATTTGTTCTACTGGGGCTCTCTCAGGTTT 
25 GGGAGCTTCGGTTTGTTTTCTTCACTGTTTTCTCTGCTGTGTATT^ATGACTGTAGTGGGA 
AACCTTCnrTATTGTGGTCATAGTGACCTCCGACCCACACCTGCACACAACCATGTATTTTCT 
CTTGGGCAATCTTTCTTTCCTGGACTTTTGCTACTCTTCCATCACAGCACCTAGGATGCTGG 
TTGACTTGCTCTCAGGCAACCCTACCATTTCCTTTGGTGGATGCCfGACTCAACTCTTCTTC 
TTCCACTTCATTGGAGGCATCAAGATCTTGCTGCTGACTGTCATGGCGTATGACCGCTACA 
30 TTGCCATTTCCCAGCCCCTGCACTACACGCTCATTATGAATCAGACTGTCTGTGCACTCCTT 
ATGGCAGCCTCCTGGGTGGGGGGCTTCATCCACTCCATAGTAGAGATTGCATTGACTATCC 
AGCTGCCATTCTGTGGGCCTGACAAGCTGGACAACTTTTATTGTGATGTGCCTCAGCTGAT 
CAAATTGGCCTGCACAGATACCTTTGTCTTAGAGCTTTTAATGGTGTCTAACAATGGCCTG 
GTGACCCTGATGTGTTTTCTGGTGCTTCTGGGATCGTACACAGCACTGCTAGTCATGCTCC 
35 GAAGCCACTCACGGGAGGGCCGCAGCAAGGCCCTGTCTACCTGTGCCTCTCACATTGCTGT 
GGTGACCTTAATCTTTGTGCCTTGCATCTACGTCTATACAAGGCCTTTTCGGACATTCCCCA 
TGGACAAGGCCGTCTCTGTGCTATACACAATTGTCACCCCCATGCTGAATCCTGCCATCTA 
TACCCTGAGAAACAAGGAAGTGATCATGGCCATGAAGAAGCTGTGGAGGAGGAAAAAGG 
ACCCTATTGGTCCCCTGGAGCACAGACCCTTACATTAG (SEQ ID NO: 80) 

40 

AOLFR43 sequences: 

MQKPQLLWIL\TSNGhfLVHAAYFLLVGIPGLGPTIHFWLAFPLCFMYALATLGNLTIVLim 
PvRLHEPMYIJLAMLSTIDLVLSSITMPKMASLFLMGIQEIEFNI^^ 

MAFDRFVAICHPLRHASVLTGCWAKIGLSALTRGFVFFFPLPF1LKWLSYCQTHTVTHSFCLHQ 

45 DIMKl^CIDTRVNVVYGLniLSVMGVDSLFIGFSYILILWAVLEI^SRRAALKAFNTCISHLCAV 

LVFY^LIGLSVVHRLGGPTSLLHVVMANTYLLLPPVVNPLVYGAKTKJEICSRV^ 
(SEQ ID NO: 81) 



ATGCAGAAGC CCCAGC TCTTGGTCCCTATCATAGCCACTTCAAATGGAAATCTGGTCCACG 
50 CAGCATACrT CCTTT TGGTGGGTATCCCrGGCCTGGGGCCTACCATACACrTTTGGCTGGCT 
TTCCCACTGTGTTTTATGTATGCCTTGGCCACCCTGGGTAACCTGACCATTGTCCTCATCAT 
TCGTGTGGAGAGGCGACTGCATGAGCCCATGTACCTCTTCCTGGCCATGCTTTCCACTATT 
GACCTAGTCCTCTCCTCTATCACCATGCCCAAGATGGCCAGTCTITTCCTGATGGGCATCCA 
GGAGATCGAGTTCAACATTTGCCTGGCCCAGATGTTCCTTATCCATGCTCTGTCAGCCGTG 
55 GAGTCAGCTGTCCTGCTGGCCATGGCTTTTGACCGCTTTGTGGCCATTTGCCACCCATTGC 
GCCATGCTTCTGTGCTGACAGGGTGTACTGTGGCCAAGATTGGACTATCTGCCCTGACCAG 
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rATGGACTCTTCATCATCCTCTCAGTCATGGGTGTGGACTCTCTCT 



rACCCTC A A TGTGGTTT ATGG ACTCTTC A 1 UAltLlLi a w\ i vjvjvj * v_, , 
?r^G^^CTCATATATCCT^^ 

cctStoggc^cggtggtgcataggctgggtggtcccaccto 
a^g^taS^ 

SSEJ^oto^ (SEQ id 



NO: 82) 



10 



AOLFR44 sequences: 
(SEQ ID NO: 83) 



£™S?g^gSgcca^ 

rriTTC^GGTTTG^ 

?^SctcTcagcca^g^ 
Satc?gSa^ccactgcgccatgct^^ 

r^TCCTGGCTGTGGTGCGC^ 



a^ctgcotcI^^ 

TOCCATCATCT^TGGTGCCAAAACCAAACAGATCAGAACACGGGTGC^ 

Itcagctgt^^^ ( se Q m NO: 84 > 

AOLFR45 sequences: 

40 s?n^giavawsv™ 

l^^A^APRVHILIAIFYLLFPPMVNPIIYGVKTKQIREYVLSLFQI^ (SEQ ID NO: 
85) 



35 



CATGCCTGGATCTCCATCCCCTTCTGCTTTGCTTATACTCTGG 
'CCTTCTCTTCATTATCCAGGCTGATGCAGCCCTCCATGAACCC 
TGGCAACCATTGACTTGGTTCTTTCTTCTACAACGCTGCC 



GGAGCCATGCTTCCCTCTAATAT^ 

rrrTGCTAGGCAACTGTACCCTTCTCTTCATTATCCAGGCTGATGCAGCC 



50 



55 



ATP ArCA^GATTGGCATGGCTGCTGTGGCCCGGG 
rASfiCTGTGGTAAGG^ 
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TCAGGCAGTTCTCCAGCTTGCCTCTCAGGAGGCCCGCTACAAAGCATTTGGGACATGTGTG 
TCTCACATAGGTGCCATCCTGTCCACCTACACTCCAGTAGTCATCTCTTCAGTCATGCACCG 
TGTAGCCCGCCATGCTGCCCCTCGTGTCCACATACTCCTTGCTATrTTCTATCTCCTTTTCC 
CACCCATGGTCAATCCTATCATATATGGAGTCAAGACCAAGCAGATTCGTGAGTATGTGCT 
5 CAGTCTATTCCAGAGAAAGAACATGTAGATGGATAGTTCTCTTTTTTTATCCCACTTGCCA 
AG I^ GAGAATGCTGGA ^ GG GGTTGAGGGGAA^ 

ATCTTTGACAATTCTCTAGTATGATAAGGAAAATGAGGTTTCATTCCTCACAGATCTACGA 

GTCAGGTCAAACCAGGAGTGCACCTATAGTCTGGTCTGATAGTAGAGGTTTGACCTTCCCA 

TTGTCATAGACTCATCACATGGCTAAGGAAGACAAACCTCTCAAAGTGGTATTGTAATCTG 

GGTGAAAGACAGTAGGACCTTTATTGGCTGAGATTGGCCCAAACAGCTGAGTC fSEO ID 
NO: 86) V V 



10 



AOLFR46 sequences: 

MNIKHCGWHMmTWLNIREDDDSDFKNFIGQIQGLSGNPHSTTSRMYFLCTCTSLLGFKXOTWV 
1 5 SRLDOO.YMASPNNDSTAPVSEFmCITW^ 

LHQPLYYLLSLLSLLDIVLCLTV1PKVLAIF\\TDLRSISFPACFLQMFIMNSFLTMESCTFMVMA 
YDRYVAICHPLRYPSIITDQFVARAVVFVTARNAFVSLPVPMLSAJiLRYCAGNnKNCICSNLSVS 
KLSCDDITFNQLYQFVAGWTLLGSDLILIVISYSFILKVVLRIKAEGAVAKALSTCGSHFILILFFS 
TVLLVLVITNLARKRIPPDVPILLNILHHLIPPALNPIVYGVRTKEIKQGIQNLLKRJ. (SEQ ID NO: 

ATGAATATAAAACATTGTGGCTGGCATATGATACATACTTGGTTAAATATAAGGGAGGAT 
GATGACAGTGATTTTAAAAACTTTATTGGACAGATACAGGGCCTCAGTGGAAACCCACACT 
CTACTACGTCTAGAATGTACTTTTTATGTTTCTGTACTTCTCTACTAGGTTTTAAGGTACAC 
25 TGGGTCTCCAGATTGATCANGAAACTTTACATGGCATCTCCCAACAATGACTCCACTGCCC 
CAGTCTCTGAATTCCTCCTCATCTGCTTCCCCAACTTCCAGAGCTGGCAGCACTGGTTGTCT 
CTGCCCCTCAGCCTTCTCTTCCTCCTGGCCATGGGAGCTAACACCACCCTCCTGATCACCAT 
CCAGCTGGAGGCCTCTCTGCACCAGCCCCTGTACTACCTGCTCAGCCTCCTCTCCCTGCTGG 
ACATCGTGCTCTGCCTCACCGTCATCCCCAAGGTCCTGGCCATCTTCTGGTTTGACCTCAGG 
30 TCGATCAGCTTCCCAGCCTGCTTCCTCCAGATGTTCATCATGAACAGTTTTTTGACCATGGA 
GTCCTGCACGTTCATGGTCATGGCCTATGACCGTTATGTGGCCATCTGCCATCCATTGAGA 
TACCCGTCTATCATCACrGACCAGTTTGTGGCTAGGGCCGTGGTCTTTGTTATAGCCCGGA 
ATGCCTTTGTTTCTCTTCCTGTTCCCATGCTTTCTGCCAGGCTCAGATACTGTGCAGGAAAC 
ATAATCAAGAACTGCATCTGCAGTAACCTGTCTGTGTCCAAACTCTCTTGTGATGACATCA 
3 5 CTTTCAATCAGCTCTACCAGTTTGTGGCAGGCTGGACrCTGTTGGGCTCrGATClTATCCTT 
ATTGTTATCTCCTATTCTTTTATATTGAAAGTTGTGCTTAGGATCAAGGCCGAGGGTGCTGT 
GGCCAAGGCCTTGAGCACGTGTGGTTCCCACTTCATCCTCATCCTCTTCTTCAGCACAGTCC 
TGCTGGTTCTGGTCATCACTAACCTGGCCAGGAAGAGAATTCCTCCAGATGTCCCCATCCT 
GCTCAACATCCTGCACCACCTCATTCCCCCAGCTCTGAACCCCATTG'nTATGGTGTGAGA 
40 ACCAAGGAGATCAAGCAGGGAATCCAAAACCTGCTGAAGAGGTTGTAA (SEQ ID NO: 88) 

AOLFR47 sequences: 

MSASMTLTHPTAFLLVGIPGLEHLHIWISIPFCLAYTLALLGNCTLLLIIQADAALHEPMYLFLA 
MIAAIDLVLSSSALPKMLAIFWFRDREINFFACLAQMFFLHSFS1MESAVLLAMAFDRYVAICK 
45 PLHYTK\O.TGSLITKIGMAAVARAVTLMTPLPIT.LRCFHYCRGPVIAHCYCEHMAVVRLACGD 
TSFNNIYGIAVANflTV\^DLLLVILSYIHLQAVLLL^ 

SVMHRVARHAAPHVHILLANFYLLFPPMVTsfPir/GWTKQIRESILGVFPRK^M fSEO ID NO" 
89) 

50 ATGTCAGCCTCCAATATCACCnTAACACATCCAACTGCCrTCTTGTTGGTGGGGATTCCAG 
GCCTGGAACACCTGCACATCTGGATCTCCATCCCnTCTGCTTAGCATATACACTGGCCCTG 
CTTGGAAACTGCACTCTCCTTCTCATCATCCAGGCTGATGCAGCCCTCCATGAACCCATGT 
ACCTCTTTCTGGCCATGTTGGCAGCCATCGACCTGGTCCTTTCCTCCTCAGCACTGCCCAAA 
ATGCTTGCCATATTCTGGTTCAGGGATCGGGAGATAAACTTCTTTGCCTGTCTGGCCCAGA 

55 TGTTCrTCCrTCACTCCTTCTCCATCATGGAGTCAGCAGTGCTGCTGGCCATGGCCTTTGAC 
CGCTATGTGGCTATCTGCAAGCCACTGCACTACACCAAGGTCCTGACTGGGTCCCTCATCA 



76 
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WO 01/98526 



3CCCGGGCTGTGACACTAATGACTCO 

rrGCCG 

TGTGG< 

C^G™TTGTGGTGgG^ 



GTATTCCCAAGAAAGGATATCTAG (SEQ ID NO: 90) 



10 



1 5 (SBQ H> NO: 



VT- 
HI 
91) 

ATGATGGTGGATCCCAATGGCAATGA 
20 CTGGTTTAGAAGAGGCTCAGTTCTCOT^^ 
GTGCTAGGTAAOTGACAAT^^^ 
TGTATATATTTCTTTGCATGCTrrcAGGCATTGA^^ 

CATCATCTCCGCCATTGGCCT^ 

^^C^CTGGCCACACACCCrrCAGAGCCCTAG (SEQ ID NO: 92) 

NO: 93) 
ATGCTCACT 

rrrCATAATGTCTGCTCAGTACCCAGCTCC^CTGGCrCAC^ 

55 SSSSSSS^Uo»"»» 
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ATTTGGTCAGTGTGTGCCTCCTCCAGTCCACACTCTGCT 

CTCCAATCCTCAATCCCATTGTCTATGCTGTTCGCACCAAGCAGATCCGAGAGAGCCTTCT 
CCAAATACCAAGGATAGAAATGAAGATTAGATGA (SEQ ID NO: 94) 

AOLFR50 sequences: 

MNLDSFFSFLLKSLIMAI^SSWlUJQPSFFLVGIPGLEESQHWIALPLGILYLLALVGmrriLFn 
WMDPSlJiQSMYLFLSMIAAIDLYVASSTAPKAI^VLLyRAQEIGYTVCIJOMFFTT^ 
GVLVAMALDRYVAlCHPLHHSmHPGVlGHIGMVVLVRGLLLLIPFLILLRKLIFCOATnGHAY 
CEHMAVVKLACSF^VMlAYGLTVALLWGLDVLAIGVSYAjnLQAVLKWGNEARLKAF^T 
( ^^^^ YIPGMFSFLTHRFGHH VPHHVHVLLAJLYRLVPPALNPLVYRVKTQKIHQ 

ATGAATTTGGATTCTTTTTTCTCITTCCTCCTCAAGTCATTGA 

CAGCTGGAGGCTACCCCAGCCTTCriTTTTCCTGGTAGGAATTCCGGGTITAGAGGAAAGC 

CAGCACTGGATCGCACTGCCCCTGGGCATCCTTTACCTCCTTGCTCTAGTGGGCAATGTTA 

CCA7TCTCTTCATCATCTGGATGGACCCATCCTTGCACCAATCTATGTACCTCTTCCTGTCC 

ATGCTAGCTGCCATCGACCTGGTTGTGGCCTCCTCCACTGCACCCAAAGCCCTTGCAGTGC 

TCCTGGTTCGTGCCCAAGAGATTGGTTACACTGTCTGCCTGATCCAGATGTTCTTCACCCAT 

GCATTCTCCTCCATGGAGTCAGGGGTACTTGTGGCCATGGCTCTGGATCGCTATGTAGCCA 

TTTGTCACCCCTTGCACCATTCCACAATCCTGCATCCAGGGGTCATAGGGCACATCGGAAT 

GGTGGTGCTGGTGCGGGGATTACTACTCCTCATCCCCTTCCTCATTCTGTTGCGAAAACTT 

ATCTTCTGCCAAGCCACCATCATAGGCCATGCCTATTGTGAACATATGGCTGTTGTGAAAC 

TTGCCTGCTCAGAAACCACAGTCAATCGAGCTTATGGGCTGACTGTGGCCTTGCTTGTGGT 

TGGGCTGGATGTCCTGGCCATTGGTGTTTCCTATGCCCACATTCTCCAGGCAGTGCTGAAG 

GTACCAGGAAATGAGGCCCGACTTAAGGCCTTTAGCACATGTGGCTCTCATGTTTGTGTCA 

TCCTGGTCTTCTATATCCCGGGAATGTTCTCCTTCCTCACTCACCGCTTTGGTCATCATGTA 

CCCCATCACGTCCATGTTCTTCTGGCCATACTGTATCGCCTTGTGCCACCTGCACTCAATCC 

TCTTGTCTATAGGGTGAAGACCCAGAAGATCCACCAGTGA (SEQ ID NO: 96) 

AOLFR51 sequences: 

M CQQILRDCILLIHHLCINPJCKVSLVMLGPAYNHTMETPASFLLVGIPGLQSSHLWLMSI^ 

YUALLGNTUVTAIWMDSTRHEPMYCFLCVLAAVDIVMASSVVPKMVSIFCSGDSSISFSACFTO 

^^ TA ^ GLLLT ^DRYVAICI0PU1YKRIL 

HLPFCGS>JVVVHSYCEHIAI^PXACADPVPSSLYSLIGSSLMVGSDVAFIAASYILILKAVFGLSS 

KTAQLKAI^TCGSHVGVMALYYLPGMASry-AAWLGQDVWLHTQVLLADLYVnPATLNPIIY 
GMRTKQLRERIWSYLMHVLFDHSNLGS (SEQ ID NO: 97) 

ATGTGTCAACAAATCTTACGGGATTGCATTCTTCTCATACATCATTTGTGCATTAACAGGA 

AAAAAGTCTCACTTGTGATGCTGGGTCCAGCTTATAACCACACAATGGAAACCCCTGCCTC 

CITCCTCCTTGTGGGTATCCCAGGACTGCAATCTTCACATCTTTGGCTGGCTATCTCACTGA 

GTGCCATGTACATCATAGCCCTGTTAGGAAACACCATCATCGTGACTGCAATCTGGATGGA 

TTCCACTCGGCATGAGCCCATGTATTGCT1TCTGTGTGTTCTGGCTGCTGTGGACATTGTTA 

TGGCCTCCTCGGTGGTACCCAAGATGGTGAGCATCTTCTGCTCAGGAGACAGCTCAATCAG 

CTTT^GTGCTTGTTTCACTCAGATGTTTTTTGTCCACTTAGCCACAGCTGTGGAGACGGGG 

CrGCTGCTGACCATGGCTTTTGACCGCTATGTAGCCATCTGCAAGCCTCTACACTACAAGA 

GAATTCTCACGCCTCAAGTGATGCTGGGAATGAGTATGGCCATCACCATCAGAGCTATCAT 

AGCCATAACTCCACTGAGTTGGATGGTGAGTCATCTACCTTTCTGTGGCTCCAATGTGGTT 

GTCCACTCCTACTGTGAGCACATAGCTTTGGCCAGGITAGCATGTGCTGACCCCGTGCCCA 

GCAGTCTCTACAGTCTGATTGGTTCCTCTCTTATGGTGGGCTCTGATGTGGCCTTCATTGCT 

GCCTCCTATATCTTAATTCTCAAGGCAGTATTTGGTCTCTCCTCAAAGACTGCTCAGTTGAA 

AGCATTAAGCACATGTGGCTCCCATGTGGGGGTTATGGCTTTGTACTATCTACCTGGGATG 

gcatccatctatgcggcctggttggggcaggatgtagtgcccttgcacacccaagtcctgc 

TAGCTGACCTGTACGTGATCATCCCAGCCACCTTAAATCCCATCATCTATGGCATGAGGAC 

caaacaactgcgggagagaatatggagttatctgatgcatgtcctctttgaccattccaac 
ctgggttcatga (seq id no: 98) 
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AOLFR52 sequences: , 

MLGPAYNHTMETPASFLLVGIPGLQSSHLWLAISLSAMYITALLGNTLIVTAIWMDST^EP^ 

cf^cv^a^ 

S T ACADPVPS^YSLJGSSLMVGSDVAFIAASYIL^ 
Y^G^^AA^S^ 
SNLGS (SEQ ID NO: 99) 

ATGCTGGGTCCAGCTTACAACCACACAATGGAAACCCCTGCCTCCnTCCTCCITGTGGGTA 
1 ft Trr^GGACTGCAATCTTCACATCTTTGGCTGGCTATCT 

10 Igco^S™ggCc^ctcatcgtga CT ^ 

CCCATGTATTGCTTTCTGTGTGTTCTGGCTGCTGTGGA^ 
ArScGA^GAT^^ 

rrA^A^AGCTTTG^CC^GGTTAGCATGTGCTGACCCCGTGCCCAGC 

20 toSaSgcIot^gatctctcct 

CGCT^CCATGTGGGGGTTATGGC^ 
CCTOG^GGGGCAGGATATAGTC 

g^caTccSg^c^^^ 
ggWtgg^ 

25 ID NO: 100) 



AOLFR54 sequences: 

MQFXiMT qnNffl pdtffltgipgleaahfwiaipfcamylvalvgnaalilvtamdnaxhapmy 
Sltopjghhewkhvhiflanlyvlvppvlnpilygartkeip^ (SEQ ID 

NO: 101) 

SSg^aS^ 

atotacctctoctctgccttctctcactcacagacctggctct 
caagatgctgg^xattttgtggctcca 

40 CA^ATGTTT^GT^TCCATTCTATCTATGCTCTGGAGTCCTCGATTCT 
TGATAGGTATGTGGCTATCTGTA^ 

A^AG^CAGAAirGGCTTTGTTGGGCTATTCCGTAGTGTGG 



45 




50 TCTAAAACTGCTTCACCTGGGGAAGACTTCAATATGA (SEQ ID NO: 102) 

AOLFR57 sequences: „„ Trl , / ™„ 
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ATGTCATTTCAGGTGACITATATGTTCTATCTACACTGGACCATGGAAAAAAGCAATAATA 
^ A £™ mA " CT OTGGGGTm^ 

TTATTTTTGTTTTGCTACATTGCTATrTGGATGGGAAAOT 

GTGCACCCAGCTCATTCACCAACCCATGTATTTCTTCCTCAATTACCTCT 

TIT^CTACACATCCACAGTGACCCCCAAATTAATGGTTGACITACrGGCAGAAAGAAAG 

CATTTCCTATAATAACTGTATGATACAACTCITTACCACCCATTTTTTTGGAGGCA^^ 

TCTTCATTCTCACAGGGATGGCCTATGACCGCTATGTGGCCATTT^ 

CACCATTATTATGAGCAGGCAAAAGTGTAACACAATCATCATAGTTTGTTGTACTGGGGGA 
T^ATACATTCTGCCAGTCAGTTTCTTCTCACCATCTTTC 

GATAGATCACTACTTCTGTGATGTGTATCCTTTGCTGAAATTGGCCTGTTCTAATA^ACAOV 
TCATAGGTCTCTTAGTCATTGCTAATTCAGGCTT^^ 

TTGTTGTCTTATGTTTTTATATTGTATACCATCAGAGCATACTCTGCAGAGAGACGCAGCA 
AAGCTCTTGCCACTTGTAGTTCTCATGTAATTGTTGTGGTCCrGTTTTTTGCTCCTGCATTG 
TTCATTTACATTAGACCGGTCACAACATTCTCAGAAGATAAAGTGTTTGCCCTTTTTTATAP 
CATCATTGCTCCCATGTTCAACCCTCTCATATACACGCTGAGAAACACAGAGATGAAG^C 

IV NO: 104) x ^ 

AOLFRS8 sequences: 
^*™ALNNFALGC^ 

VLLGLSQNPNVQEIVFVWLFVYIATVGGNMLIVVTILSSPALLVSPMYFFLGFI^FLIMCFSSVT 
^IVDSLYVTKTISFEGCMMQLFAEHFFAGVEVIVLTAMAYDRYVAICKPLHYSSIMN^ 
CGILMGVAWTGGLLHSMIQILFTFQLPFCGPNVINHFMCDLYPLLEl^CTDTHIFGLMVVINSG 
HC1INFSLLLVSYAVILLSLRTHSSEGRWKALSTCGSHIAVVILFFVPCIFVYTRPPSAFSLDKMA 
AIFYIILNPLLOTLrYTFRNKEVKQAMRimVNRLMVVSDEKENIKL (SEQ ID NO: 105) 

TGACTATGATACCACAAATTGATCTGAAGCAAATTTTCCTTTGTCCTAATTGCAGACTATA 

AC Z^^ GTCCTCCTGGGAC ^ 

TATTTTTGTTTGTCTACATTGCAACTGTTGGGGGCAACATGCTAATTGTAGTAACCATTCTC 
AGCAGCCCTGCTCTTCTGGTGTCTCCTATGTACTTCTTCTTGGGCTTCCTGTCCTrCCTGGA 

tgcgtgcttctcatctgtcatcaccccaaagatgattgtagactccctctatgtSacaa^^ 

ACCATCTCTTTTGAAGGCTGCATGATGCAGCHX^ 

AGGTGATTGTCCTCACAGCCATGGCCTATGATCGTTATGTGGCCATITGCAAGCCCTTGCA 

TTACTCTTCTATCATGAACAGGAGGCTCTGTGGCATTCTGATGGGGGTAGCCTGGACAGGG 

GGCCTCTTGCATTCCATGATACAAATTCITTTTACrTTCCAGCTTCCCT 

T GTGA I CAATCAcmATGTGTGAC ^ m 

ACATCTTTGGCCTCATGGTGGTCATCAACAGTGGGTTTATCTGCATCATAAACTTCTCCTTG 

JT^™™ CTATGCTGTCATC ^ GCTCT 

GAAAGCTCTCTCCACCTGTGGATCTCACATTGCrGTTGTGATTTTGTTCTTTGTCCCATGCA 

A^C^A^sIqI^ 
AOLFR59 sequences: 

MGDWNNSDAVEPIFIlJlGFPGLEYVHSWl^ILFCLAYLVAFMGNVTIl^VIWIESSi^ 

^^!^™ S ^ GMG ^ CLL ^ L ^ 
RTNSrTCLCVVIATLGVDSIFILI^YVLILNrVLDMS^ 

MVHRFGKM^PIVHILMADIYLLLPPVLN^ (SEQ ID NO- 
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TTTGTCCTAAGGAGGAGGTTTTAA (SEQ ID NO: 108) 



15 



AOLFR60 sequences: » 

FG^RYffl™ANLYVVWPMLNPVIYGVR^ 
25 (SEQ ID NO: 109) 

30 TCTA?^C^GC^ 

ACAG^GGO^^ 

TrrrrrTn a TTfiGTrTTGGTGTGTTTGTG AGGGCTTTAATTTTCGTCATTCCCT AlALii^ 
35 ™a™cSSScOTCTGTGGGAATCATG^ 

g^aa^ctot^^^ 
?SS™§a^aacaaggaatggaaaa 

(SEQIDNOfllO) 

50 ^SPL^LMANVLLLVPPLMKPIVYCVKTKQIRVRVVAKLCQWKI (SEQ ID NO: 111) 



55 SSSSSSS^ 
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JTGTC 

nrcc( 

TOTCATGAAGTTGGCCrGTTCr^ 



AOLFR62 sequences: 

HRFGHQIPGYIHILVAhn^YLIIPPSLNPUYGVRTKQIR^VLYVFIX^ 

ATGTTTTATCACAACAAGAGCATATTTCACCCAGTCACATTTITCCTCATTGGAATrrr 
^ GGAAGAOTCCACATG ^^ 

tgggcaatgccaccattctgctagtcatcaaggtagaacag^ctctdc^ag2c 



A^.X^r. r ,WUUJJ i vjy\ i i ^AlJUAuA JTAACTATGGAGCTTGTGTGGCCCACtAT 

gtttctgatccatgccttcactggcatggaggctgaggtcttactggctatgg 
cgttatgtggccatctgtgctccactacattacg^ 



rTGTCAGGCTCACATAATAGCCCATTCCTACTGTGAGCACATGGG 



TCTACCGCCTACCCTT 
CA II G ^ AAA ^ GTCCTGTGG ^CATTC 
CTTTCTITGTTCrGAACCTGGTGCTCATTGGCATCTCGTATGTTTACATT 
™ GG 5°^ CCA ™^^^ 

GAGTCATC^GTGTTTTCTATATCCCITCAGTCrTCTCTTTCCTTACTCATrc 
^ TACCAGG ™ CA " CA ^^ 

AOLFR63 sequences: 
AJ^DLGl^LSSLPTVLSIFLFNAPEIS^ 

ILTTWVAQIGATSFKSMLLVLPFPFTLPJ^RYCKKNQLSHSYCLHQ 

gffgalclmvdfiliavsytlilktvlgiaskkeqlka£ntcvshic^w 
hvsplinvlmanvlllvppltnpr^c^ 



cttggttgggatgccag 



GGCTAGAATATGCACACATCTGGATCTCTATCCCCA^ 

GTCTTAAGCATCTTCCTGTTCAATGCTCCrGAAATTTCATCCAATGCCTGCTrrGCCCAGGA 

CCAAATAGGGATAGTATTCTCCTTTAAGAGCATGCTCCTGGTTCrrCCCITCCCrTTrA(^ 

CTCTGCCTTATGGTAGACTITATTCTCATTGCTGTGTCTTACACCCTGATCCTCAAGACTGT 
TGTGCAGTGATCATCTTCTACCTGCCCATCATCAACCTGGCCGTTGTCCACCGCTTTGCCCG 
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GTGTCAACGGAAGATTTAA (SEQ ID NO: 116) 




15 



20 



35 ^SISSS^vS^IYGWTXQIBERVlMFlXTOHCSEQlDNO:!!!.) 



CATGGCCTTTGACCGCTACATTGCCATTT( 



45 

^^^A T ATTr,r,TnTTATCITAGCCTTTTTTACACCAGCATTTT^ 



GCAAAATCATCAGCCTCATTGGAGGCATrGCTGTCCTGAGGAG^ 

actggtgtttctccttctg 
gtgagcacatgggcattg< 
ccttggcaacatatctctc 
tcctgtatgctgtcttctg 

50 tocaSagc^ 

TGCTGAGGATTTTTCTCAAGACCAATCACTAA (SEQ ED NO: 120) 
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TFI^GGHTlPLHmilMA^ ( ^q 

^^^ mCT ^ TGGCACCAGCCT AACTCCAGCTTCATTCATCCTAAATGGCATCCCTG 
GTTTGGAAGATGTGCATTTGTGGATCTCCITCCCACTGTGTACCATGTACAGCATTGCTATT 
ACAGGGAACTTCGGCCTTATGTACCTCATCTACTGTGATGAGGCC^ACA^ 
ATGTCTTCCTTGCCCTrCITTCCTTCACAGATGTGCTCATGTGCACCAGCACOT 

gtt^gtgcacaccttcacagggatggagtctggggtgct^^^ 

GACTGTGTGGCCATCTGCTTCCCTCTGCGTTATGCCACCATCCTCACT^ 
TAAAGCTGGGTTCCTCACTTTTCTTAGGGGTGTGATGCT^ 

CCAAGCGCCTTCCATACTGCAAGGGCAACGTCATACCCCACACCTACTGTGACCACATCTC 
TGTGGCCAAGATATCTTGTGGTAATGTCAGGGTTAACGCC 

ctgctgattgggggctttgatatcctgtgcattaca^^ 

^GTGAGTCTATCATCAGCAGATGCTCGACAG^^ 

TTCTGTGCCATAGTCCTCACCTATGTTCCAGCCTTCITTACCTTCITTACACA^ 
GGGACACACCAITCCTCTACACATACATATTArrATGGCTAATCTCTAC^ 

CCACAATGAACCCTATTGTGTATGGGGTGAAAACCAGGCAGGTACGAGAAAGTCTCA^ 
GGTTCTTTCTTAAGGGAAAGGACAATTCTCATAACTTTTAA (SEQ ID NOM22) 



AOLFR67 sequences: 

9* ^gdnsssl^ 

25 L^FTDVTLCTTMVPNMLCIFWNLKEroFNACIAQMFFVHMLTGMESGVLMLMALDR^ 
CYPLRYATILTWVIAKAGLATFLRNVMLIIPFTLLTOP 

FK^IYGLMVALLIGWDICCISVSYTMLQAVMSLSSADAPJKAFSTCTSH^^ 

, , ?S^ VG ^™^^ 
(oxiv^ IV NO: 123) 

30 

ATGTCTGGGGACAACAGCTCCAGCCTGACCCCAGGATTCirrATCTTGAATGGCGTTCC^^ 

ggctggaagccacacacatctggatctccctgccattctgc^^ 

ACTACTTCCTGGCCCTGCTCTCCTTCACTGATGTCACCTTGTGCACCACCATGG'^ 

TGTTTTTTGTCCATATGCTGACAGGGATGGAGTCTGGGGTGCTCATGCTCATGGCCCTGGA 
CCAAGGCTCX5TCTTCX;CACCTTCTTGA^ 

ACCAAGCGCCTGCCCTATTGCCGGGGGAACrrCATCCCCCACACCTACTG^ 

TCTCCTGATTGGTGTGTTTGATATCTGCTGTATCTCTGTATCTTACACTATGATT^GCA 
CTGTTATGAGCCTGTCATCAGCAGATGCTCGTCACAAAGCCTTCAGCACCTG^CACATCTCA 

AGGACACAATATCCCAAACCACATACACATCATCGTGGCCAACCTTTATCTGCTACTGCCT 
CC i^J G ^ CCCAA ™^ A TGGAGTCAAGACCAAGCAGATTCAGGAAGGTGTAATTA 
AATTTTTACnTGGAGACAAGGTTAGTTTTACCTATGACAAATGA (SEQ ID NO- 124) 



AOLFR68 sequences: 
NrTTORNDTIiJTEASDI^^ 

™]?^ YQFAGG ™ LGSDL ^^^ 
L^raVAKKKVSPDWVIJL^^ 



55 
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ATGACAACACACCGAAATGACACCCTCTCCACTGAAGCTTCAGAClT^^^TG^^^roT^ 
TrGTCAGATCCCCCAGCTGGCAGCACTGGCTOTC^ 



5 CCCAAGGTCCTGACCATCTTCTCjO 1 1 1 UALt l » ^ a o^-r. a xr-rTr a Tnnrr 

rrAGATGTACATCATGAATTGTTTCCTAGCCATGGAGTCTTGCACATTCATGGTCATGGCC 

?atg^tcg™gtagccatctgcc^^^ 
?otag?cSgg^ 

rp^f^f^SSS^^^^^^G<^TCTGACCrcATCCTTATCTTCCrCTCCTACACClTCATTCT 
CTCCCACTTCATGCTCATCCTCTTCTrCAGCACCA^rcCTTCTGGTrTrrG^^ 

CAGAGGTTGTTGAAGAAAGGGTGCTAA (SEQ ID NO: 126) 
AOLFR69 scouciiccst 

IYSLRNKDVKDSMKKMFGKNQVINKVYFHTKK (SEQ ID NO: 127) 



20 



25 



r^TA^TTTTGGAACCACM 

„ ^I^^atcSgctggcattitacatgctactatacatacagtggct^^ 

^c^ctggaScaatgaW 
t^ctta^ttc^g^cactcacacaaac 

, n ^^Tatt?at?atgggacaatcctcttcatgtatgtgagaccaag 

r^^CA^AGTG^^ 

GGTTATCAATAAAGTATATTTTCATACTAAAAAATAA (SEQ ID NO: 128) 
45 AOLFR70 sequences: ' 

50 kgr|^ 

KAVKKMLKLVYK (SEQ ID NO: 129) 
TrTTTATA^tTGAO 
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A ^ C J GGCTCCACAACCCCATGTA ™T^ 

TATTCTACAGTTGTCACTCCAAAAATGTTGGTCAATTTCCTGGCAAAAAATAAATCCATTT 

^IE A ^ CGGATGTGCAACA ^^ 

<; pIptmt^ ^ C AATG G CTTATG ATC A CTATGT A G CCATCTA CAACCCTCTCCTGTATTCA GT 
5 GAGCATGTCACCCAGAGTCTATGTGCCACTCATCACTGCTrTCCTACGTTGCTGGCATTTTAC 
ATGCTACTATACATATAGTGGCTACATTTAGCCTGTCCITCTGTGGATCCAATGAAA^AG 

AGCTTCTACTCTTCTACTTTGTGGGTTCTATTGAGATAGTCACTATCCTGATTGTCCTCATT 
, n IS™ ^ CA ^ Q ^ CCA ^GAAGATGCATTCn:GGrAAGGGAAGGCAA^ 
0 ^^ CTACATGTGGCTCTCACCTAACTGGAGTGACAA ™ TG ATGGAACAA 

ACACAATTGTGATTCCCAAGTTGAATCCCATCATCTATAGTTTGAGGAACAAAGAAGTAAA 
AAAGGCAGTGAAGAAAATGTTGAAATTGGTTTACAAATGA (SEQ ID NO: 130) 

15 AOLFR71 sequences: 

MGWWNTNVPDFILTGLSDSEEVQMALFE.FLLIYLITMLGNVGMILIIRLDLQLHTPMYFF^ 
I^FIDI^YSTVITPKTI^NLLTSNYISFMGCFAQMFFFWLGAAECFLI^SMAWRYVAlS^ 
PVIMSKi^CCALVTGPYVISFmSFVN\^MSRLHFCDSNV\TmFFCDTSPILALSCMDTYDTF.T 
on ^LAGSTLMVSLITISASYVSILSTILKmSTSGKQKA^TCASHLLGVTIFYGTMIFTYLKPP^ 
20 SYSLGRDQVASVFYTIVIPMLNPLIYSLRNKEVKNALIRVMQRRQDSR (SEQ ID NO: 131) 

ATGGGTAGAAGAAATAACACAAATGTGCCTGACTTCATCCTTACGGGACTGTCAGATTCTG 
AAGAGGTCCAGATGGCCCTCTTTATACTATTTCTCCTGATATACCTAATTACTATGCT^ 
o < A ^I G J GGGGATGATA " GATAATG CGCCTGGACCTCCAGCTTCACACTCCCATC^ 
25 TCCTTACTCACTTGTCATTTATTGACCTCAGTTACTCAACTGTCATCACACCTAAAACC^A 
GCGAACTTACTGACTTCCAACTATATTTCCTTCATGGGCTGCTTTGCCCAGATGTTCTTT^ 
TOTCTTCTTGGGAGCTGCTGAATGTTTTCTTCTCTCATCAATGGCCTATGATCGCTACGT^ 
CTATCTGCAGTCCTCTACGTTACCCAGTTATTATGTCCAAAAGGCTGTGTTGCGCTrcrTGTC 
^^^GGGCCCTATGTGATTAGCTTTATCAACTCCTTTGTCAATGTGGTTTGGATGAGCAGAC 
30 TCCATITCTGCGACTCAAATGTAGTTCGTCACrTrrrCTGCGACACGTCr^^ 
CTGTCCTGCATGGACACATACGACATTGAAA^^ 

TGATGGTGTCCCTTATCACAATATCTGCATCCTATGTGTCCATTCTCTCTACCATCCT^ 
AJTAATTCCACTTCAGGAAAGCAGAAAGCTrTGTCTACTC^ 

TCACCATCTTTTATGGAACTATGATTTTTACTTATTTAAAACCAAGAAAGTCTTATTCTTTG 
GGAAGGGATCAAGTGGCTTCTGTTrTTTATACTATTGTGATTCCCATGCTGAATCCACTCAT 

AOLFR72 sequences: 

MAPENFITlVTEFn^TGVSSCPELQIPlJT^VFLVLYGLTMAGNLGIITLTSV^ 
ALINLGNSTVTAPKMLINFLVKKXTTSFYECATQLGGFLFFrVSEVTMLALMACDR 
YMVWSRIU.CLLLVSLTYLYGFSTAIWSSYWSVSYCSSMINHFYCDNVPLLALSCSDTYLPE 
^i^^ GSL ^ VS ™^ 
45 f 3 S 3 ^ S ^ TOD ^ SV1 ^^^^ (SEQ ID NO: 

ATGGCTCCTGAAAATTTCACCAGGGTCACTGAGTTTA1TCTTACAGGTGTCTCTAGCTGTC 
CAGAGCTCCAGATTCCCCTCTTCCTGGTCTTTCTGGTGCTCTATGGGCTGACCATGGCAGG 
GAAG CTGGGCATCATCACCCTCACCAGTGTTGACTCrCGACTTCAAACCCCCATGTAC^ 
50 TOTGCAACATCTGGCTCTCATTAATCTTGGTAACTCTACT 

GG TI C J7 G ^ CmA " GTATCGGAGGT AATCATGCTGGCTTTGATGGCCT^ 
GTGGCTATTTGTAACCCTCTGCTGTACATGGTGGTGGTGTCTCGGCGGCTCTGCCTCCTGCT 
GGTCTCCCTCACATACCTCTATGGCTTTTCTACAGCTATTGTGGTTTCATCTTATGTATTCT 
55 CTGTGTCITATTGCTCTTCTAATATAATCAATCATTTTTACTGTGATAATGTTCCTCTGTTA 
GCATTATCTTGCTCTGATACTTACTTACCAGAAACAGTTGTCTTTATATCTGCAGCAACAA 



35 
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ATGTCK3TTCK3TTCCITGATTATAGTTCT 

AAAATATGTTCATCAGAAGGAAGGAAAAAAGCCTTITCTACCTGTGCTTCACATATC 
CAGTCACAAm^ATGGGACATTGCTATTCATGTATGTGCAGCCC^ 

a^ggatact^^ 

AATCTGTGCTATTCCTTTAAAACAATGTAA (SEQ ID NO: 134) 
AOLFR73 sequences: 

Q^SsHTLAID^Svi ^LLIPMLNPLIYSLRNKEVKDALKRTLTNRFKIPI (SEQ ID NO: 135) 

1 5 ATGAATCATGTGGTAAAACACAATCACACGG 
TCGGGATTACAGACAACCCTGGG 

===^^ 

9 S T^rrr, AT {^CTCAGGCTGTAATrTGCTCTTCTCCCTCTCAATTGTTCTCATATCCTACATGTT 
^CTA^ScCA^CAGAATGAACTCAAGGAAA 

tgSgctctcatctgacagtggtgatcatgttctatgggacattgttatttat^ 
IcccaagtccagYc^^ 

30 AGAGAACTTTAACCAATCGATTCAAAATTCCCATTTAA (SEQ ID NO: 136) 
AOLFR74 sequences: 

SFDTDKVASIFYTLV1PMLNPLIYSLIWKDVKYALRRTWNNLCNIFV (SEQ ID NO: 137) 
ATGGAACAACACAATCTAACAACGGTGAATGAATTCATTCTTACGGGAATCACAGATATC 

40 ===== 

====== 

45 ATC^cSfcTGTAACCCT^ 

TTACTTTATCCTTCTGTGGCTACAACGTCATTAGTCATTTCTACTGTGACAGTCT 

^accittgctttgttcaaatacacatgaaattc 

^ATTOAmCATCTCTTCTGATAGTrCTm 

50 SggYtg^ctgctggcagaca^ 

CTCATAGTC^CTATGGGACTITGCTITrCATGTA^^ 

TCACACTGATAAAGTGGCTTCCATATT^ 
TOA^AGmAcbAAACAAAGATGTAAAATATGCCCTAC 

TAATATTTTTGTTTAA (SEQ ID NO: 138) 

55 
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AOLFR75 sequences: 

^lv^lpsatvpkmllmqtqtqtisypgclaqm^ 
chplhystimalpxcaslvaapwviailwllhtlmmahlhfcs 

TSLNQLSVLATVGLIFVVPSVCILVSYILIVSAVMKVPSAQGKLKAFSTCGS^ 
GVYMSPI^NHSTEKDSAASVIFMVVAPVLNPFIYSLRNNELKGTT.K^ 

GGRGGWIMRSGDRDHPG (SEQ ID NO: 139) ^^OA VAHACNPSTL 

ATGGAAGGGAAAAATCAAACCAATATCTCTGAATTTCTCCTCCTGGGCrrCTCAAGTTGRr 

^^^ GCAGGTGCTACTmGCACmCCTGTGTCTCTA ^AA 

AACTTACTCATCTTGCTGGCCATTGGCTCGGATCACTGCCTTCACACACCCATC 

^AACATCCAAACCCAAA^^ 

ctgtatgatgtttgccaatatggacaattttot^ 

G I2 GGCATCTGTCACCCmACA ^CTCCACCATTATC^^ 

GG ™ G ^ GCACCTOGG ™^ 

ATCTGCACTTCTGCTCTGATAATGTTATCCACCATTrCTTCTGTGATATCAACTC^^ 
GC J^ GTCCTG ^ CCGACAGGA GTCTTAATCAGm^ 

^GT^OTTCTGCCCAAGGAAAACTCAAGG<^CT(^ACCTGTGGATCTCACCn:GC^ 

GG 1S A YS^£* TGGAGCAAA ^ 

AC T G ^ AAAAGACTCAGCCGC ^^ 

TCATTTACAGTTTAAGAAACAATGAACTGAAGGGGACTTTAAAAAAGACCCTAAGCCGGC 
CGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTrTGGGAGGCCGAGGCGGCTG^ 
TGAGGTCAGGAGATCGAGACCATCCTGGCTAA (SEQ ID NO: 140) uuuouol ^ A1 CA 




. ATGGAGAATAATACAGAGGTGAGTGAATTCATCCTGCTrGGTCTAACCAATGCCCCAGAA 
9JACAGGTTCCCCTCTTTATCATGTTTACCCTCATCTACCTC 
GGGGATGATCATAITAATCCTGCTGGACTCT^^ 

G ^°^ GTCTC " GCAG GCATTGGTTACTCCTCAGCTGTCACTCCAAAGG^ 
OTGCTTATAGAAGACAAAGCCATCTCCTACAGTGCCTGTGCTGCTCAGATGTrCTTTTGT 

CAGTG I GTAACCCCCTAC ^^ 

TA TAGGCTGTTATGTCATTGGTTTTCTGAATGCTTCTATC 

^^^^ 
CTGACCTGCTCTG AGy^ 

GCACACAGGTAAGGGATACCAGAAGCCTTTATCTACCTGTGGTTCTCACCTCATTGCCATT 
^C^ATTrrATATAACTGTCATCATCATGTACATACGACCAAGTTCCAGTCATrCCATGGA 
.t CA GACAAAA ™ CATCTGTOTCT ACACTATGATCATCCCCATGCTCAGTCCTATAGTCT 

A ^^H gaggaacaaagacgtgaagaatgca ™atgaaggttgttgagaaggcaa^ 

ATTCTCTAGATTCAGTCTTTTAA (SEQ ID NO: 142) 
AOLFR77 sequences: 

MGDVNQSVASDFILVGLFSHSGSRQLLFSLVAVMFVIGIXGNTV^ 

QI^LFDIGCPMVTIPiaviASDFLRGEGATSYGGGAAQIFFLTLMGVAEGVLLVIMSYDRYVA 

QPLQYPVLMRRQVCLLMMGSSWWGVLNASIQTSITUIFPYCASRIVDHFFCEVPALLKLSCA 

DTCAYEMAl^SGVLimiJLSLIATSYGHVLQAVI^MRSEEARHKAVTTCSSmTWGLFYGA 
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AVFMYMVPC 
(SEQ ID NO: 143) 



:ayhspqqdnwsli^slvtptlotliyslb3^pevwmalvkvlsraglrqmc 

43) 

ATGGGGGATGTGAATCAGTCGGTGGCCTCAGACTTCATTCTGGTGGGCCTCITCACT 
r TrrATrACGCCAGCTCCTC^ 

ca °?^acgcc^ 



fif^rAGCCAGCTCTCCCTG'ITrGACATTGGCTGTCCCATGGTCACCAl CCUUVAUAIOULA 

?Sg12^ctcSgggagAggtgccacctcctatggaggtggtgcagct^ 
?cc?^cIcSatggg?g?g^ctgagggc 



15 



20 



AOLFR78 sequences: _ „„, 

KCLSEKAVK (SEQ ID NO: 145) 

« tS^A(^GCCATCTGTCATCCTCTG 
CTTTC^TGTGGCCrrcG^CTGCTGGG 

c A 5Sa^aS 
45 gagaaagcagtaaagtaa(seqidno:146) 

AOLFR79 sequences: n on 



(SEQ ID NO: 147) 



55 ^s^^ss^^^^s 1 
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SEQ ID NO: 315, SEQ ID NO: 317, SEQ ID NO: 319, SEQ ID NO: 321, SEQ ID 
NO: 323, SEQ ID NO: 325, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, 
SEQ ID NO: 333, SEQ ID NO: 335, SEQ ID NO: 337, SEQ ID NO: 339, SEQ ID 
NO: 341, SEQ ID NO: 343, SEQ ID NO: 345, SEQ ID NO: 347, SEQ ID NO: 349, 
5 SEQ ID NO: 35 1 , SEQ ID NO: 353, SEQ ID NO: 355, SEQ ID NO: 357, SEQ ID 
NO: 359, SEQ ID NO: 361, SEQ ID NO: 363, SEQ ID NO: 365, SEQ ID NO: 367, 
SEQ ID NO: 369, SEQ ID NO: 371, SEQ ID NO: 373, SEQ ID NO: 375, SEQ ID 
NO: 377, SEQ ID NO: 379, SEQ ID NO: 381, SEQ ID NO: 383, SEQ ID NO: 385, 
SEQ ID NO: 387, SEQ ID NO: 389, SEQ ID NO: 391, SEQ ID NO: 393, SEQ ID 
10 NO: 395, SEQ ID NO: 397, SEQ ID NO: 399, SEQ ID NO: 401, SEQ ID NO: 403, 
SEQ ID NO: 405, SEQ ID NO: 407, SEQ ID NO: 409, SEQ ID NO: 41 1, SEQ ID 
NO: 413, SEQ ID NO: 415, SEQ ID NO: 417, SEQ ID NO: 419, SEQ ID NO: 421, 
SEQ ID NO: 423, SEQ ID NO: 425, SEQ ID NO: 427, SEQ ID NO: 429, SEQ ID 
NO: 431, SEQ ID NO: 433, SEQ ID NO: 435, SEQ ID NO: 437, SEQ ID NO: 439, 
15 SEQ ID NO: 441 , SEQ ID NO: 443, SEQ ID NO: 445, SEQ ID NO: 447, SEQ ID 
NO: 449, SEQ ID NO: 451, SEQ ID NO: 453, SEQ ID NO: 455, SEQ ID NO: 457, 
SEQ ID NO: 459, SEQ ID NO: 461, SEQ ID NO: 463, SEQ ID NO: 465, SEQ ID 
NO: 467, SEQ ID NO: 469, SEQ ID NO: 471, SEQ ID NO: 473, SEQ ID NO: 475, 
SEQ ID NO: 477, SEQ ID NO: 479, SEQ ID NO: 481, SEQ ID NO: 483, SEQ ID 
20 NO: 485, SEQ ID NO: 487, SEQ ID NO: 489, SEQ ID NO: 491, SEQ ID NO: 493, 
SEQ ID NO: 495, SEQ ID NO: 497, SEQ ID NO: 499, SEQ ID NO: 501, SEQ ID 
NO: 503, SEQ ID NO: 505, SEQ ID NO: 507, SEQ ID NO: 509 and SEQ ID NO: 
511. 



25 21. The method of Claim 20, wherein the ligand is a primary odorant. 

22. A compound which blocks activation of an olfactory receptor identified by 
Claim 20. 
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NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID 
NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 35, SEQ ID 
NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 43, SEQ ID NO: 45, SEQ ID 
5 NO: 47, SEQ ID NO: 49, SEQ ID NO: 51, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID 
NO: 57, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 63, SEQ ID NO: 65, SEQ ID 
NO: 67, SEQ ID NO: 69, SEQ ID NO: 71, SEQ ID NO: 73, SEQ ID NO: 75, SEQ ID 
NO: 77, SEQ ID NO: 79, SEQ ID NO: 81, SEQ ID NO: 83, SEQ ID NO: 85, SEQ ID 
NO: 87, SEQ ID NO: 89, SEQ ID NO: 91, SEQ ID NO: 93, SEQ ID NO: 95, SEQ ID 
10 NO: 97, SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, SEQ 
ID NO: 107, SEQ ID NO: 109, SEQ ID NO: 111, SEQ ID NO: 113, SEQ ID NO: 115, 
SEQ ID NO: 117, SEQ ID NO: 119, SEQ ID NO: 121, SEQ ID NO: 123, SEQ ID 
NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 131, SEQ ID NO: 133, 
SEQ ID NO: 135, SEQ ID NO: 137, SEQ ID NO: 139, SEQ ID NO: 141, SEQ ID 
15 NO: 143, SEQ ID NO: 145, SEQ ID NO: 147, SEQ ID NO: 149, SEQ ID NO: 151, 
SEQ ID NO: 153, SEQ ID NO: 155, SEQ ID NO: 157, SEQ ID NO: 159, SEQ ID 
NO: 161, SEQ ID NO: 163, SEQ ID NO: 165, SEQ ID NO: 167, SEQ ID NO: 169, 
SEQ ID NO: 171, SEQ ID NO: 173, SEQ ID NO: 175, SEQ ID NO: 177, SEQ ID 
NO: 179, SEQ ID NO: 181, SEQ ID NO: 183, SEQ ID NO: 185, SEQ ID NO: 187, 
20 SEQ ID NO: 189, SEQ ID NO: 191, SEQ ID NO: 193, SEQ ID NO: 195, SEQ ID 
NO: 197, SEQ ID NO: 199, SEQ ID NO: 201, SEQ ID NO: 203, SEQ ID NO: 205, 
SEQ ID NO: 207, SEQ ID NO: 209, SEQ ID NO: 211, SEQ ID NO: 213, SEQ ID 
NO: 215, SEQ ID NO: 217, SEQ ID NO: 219, SEQ ID NO: 221, SEQ ID NO: 223, 
SEQ ID NO: 225, SEQ ID NO: 227, SEQ ID NO: 229, SEQ ID NO: 231, SEQ ID 
25 NO: 233, SEQ ID NO: 235, SEQ ID NO: 237, SEQ ID NO: 239, SEQ ID NO: 241, 
SEQ ID NO: 243, SEQ ID NO: 245, SEQ ID NO: 247, SEQ ID NO: 249, SEQ ID 
NO: 251, SEQ ID NO: 253, SEQ ID NO: 255, SEQ ID NO: 257, SEQ ID NO: 259, 
SEQ ID NO: 261, SEQ ID NO:, 263, SEQ ID NO:, 265, SEQ ID NO: 267, SEQ ID 
NO: 269, SEQ ID NO: 271, SEQ ID NO: 273, SEQ ID NO: 275, SEQ ID NO: 277, 
30 SEQ ID NO: 279, SEQ ID NO: 281, SEQ ID NO: 283, SEQ ID NO: 285, SEQ ID 
NO: 287, SEQ ID NO: 289, SEQ ID NO: 291, SEQ ID NO: 293, SEQ ID NO: 295, 
SEQ ID NO: 297, SEQ ID NO: 299, SEQ ID NO: 301, SEQ ID NO: 303, SEQ ID 
NO: 305, SEQ ID NO: 307, SEQ ID NO: 309, SEQ ED NO: 311, SEQ ID NO: 313, 
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SEQ ID NO: 351, SEQ ID NO: 353, SEQ ID NO: 355, SEQ ID NO: 357, SEQ ID 
NO: 359, SEQ ID NO: 361, SEQ ID NO: 363, SEQ ID NO: 365, SEQ ID NO: 367, 
SEQ ID NO: 369, SEQ ID NO: 371 , SEQ ID NO: 373, SEQ ID NO: 375, SEQ ID 
NO: 377, SEQ ID NO: 379, SEQ ID NO: 381, SEQ ID NO: 383, SEQ ID NO: 385, 
5 SEQ ID NO: 387, SEQ ID NO: 389, SEQ ID NO: 391, SEQ ID NO: 393, SEQ ID 
NO: 395, SEQ ID NO: 397, SEQ ID NO: 399, SEQ ID NO: 401, SEQ ID NO: 403, 
SEQ ID NO: 405, SEQ ID NO: 407, SEQ ID NO: 409, SEQ ID NO: 41 1, SEQ ID 
NO: 413, SEQ ID NO: 415, SEQ ID NO: 417, SEQ ID NO: 419, SEQ ID NO: 421, 
SEQ ID NO: 423, SEQ ID NO: 425, SEQ ID NO: 427, SEQ ID NO: 429, SEQ ID 

1 0 NO: 431, SEQ ID NO: 433, SEQ ID NO: 435, SEQ ID NO: 437, SEQ ID NO: 439, 
SEQ ID NO: 441, SEQ ID NO: 443, SEQ ID NO: 445, SEQ ID NO: 447, SEQ ID 
NO: 449, SEQ ID NO: 451, SEQ ID NO: 453, SEQ ID NO: 455, SEQ ID NO: 457, 
SEQ ID NO: 459, SEQ ID NO: 461, SEQ ID NO: 463, SEQ ID NO: 465, SEQ ID 
NO: 467, SEQ ID NO: 469, SEQ ID NO: 471, SEQ ID NO: 473, SEQ ID NO: 475, 

1 5 SEQ ID NO: 477, SEQ ID NO: 479, SEQ ID NO: 481, SEQ ID NO: 483, SEQ ID 
NO: 485, SEQ ID NO: 487, SEQ ID NO: 489, SEQ ID NO: 491, SEQ ID NO: 493, 
SEQ ID NO: 495, SEQ ID NO: 497, SEQ ID NO: 499, SEQ ID NO: 501, SEQ ID 
NO: 503, SEQ ID NO: 505, SEQ ID NO: 507, SEQ ID NO: 509 and SEQ ID NO: 
511. 

20 

19. A primary odorant identified by Claim 18. 

20. A method of identifying a compound which blocks activation by a odorant of 
at least one olfactory receptor comprising: 

25 (a) producing a structurally-related candidate compound from a ligand of 

the at least one olfactory receptor, 

(b) measuring activation by the odorant of the at least one olfactory 
receptor with the candidate compound, and 

(c) identifying the candidate compound as a compound which blocks 
30 activation if activation of the at least one olfactory receptor is reduced or 

inhibited; 

wherein at least one of the olfactory receptors has an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ED NO: 5, SEQ ID 
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NO: 47, SEQ ID NO: 49, SEQ ID NO: 51, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID 
NO: 57, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 63, SEQ ID NO: 65, SEQ ID 
NO: 67, SEQ ID NO: 69, SEQ ID NO: 71, SEQ ID NO: 73, SEQ ED NO: 75, SEQ ID 
NO: 77, SEQ ID NO: 79, SEQ ID NO: 81, SEQ ID NO: 83, SEQ ID NO: 85, SEQ ID 
5 NO: 87, SEQ ID NO: 89, SEQ ID NO: 91, SEQ ID NO: 93, SEQ ID NO: 95, SEQ ID 
NO: 97, SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, SEQ 
ID NO: 107, SEQ ID NO: 109, SEQ ID NO: 111, SEQ ID NO: 113, SEQ ID NO: 115, 
SEQ ID NO: 117, SEQ ID NO: 1 19, SEQ ID NO: 121, SEQ ID NO: 123, SEQ ID 
NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 131, SEQ ID NO: 133, 

10 SEQ ID NO: 135, SEQ ID NO: 137, SEQ ID NO: 139, SEQ ID NO: 141, SEQ ID 
NO: 143, SEQ ID NO: 145, SEQ ID NO: 147, SEQ ID NO: 149, SEQ ID NO: 151, 
SEQ ID NO: 153, SEQ ID NO: 155, SEQ ID NO: 157, SEQ ID NO: 159, SEQ ID 
NO: 161, SEQ ID NO: 163, SEQ ID NO: 165, SEQ ID NO: 167, SEQ ID NO: 169, 
SEQ ID NO: 171, SEQ ID NO: 173, SEQ ID NO: 175, SEQ ID NO: 177, SEQ ID 

15 NO: 179, SEQ ID NO: 181, SEQ ID NO: 183, SEQ ID NO: 185, SEQ ID NO: 187, 
SEQ ID NO: 189, SEQ ID NO: 191, SEQ ID NO: 193, SEQ ID NO: 195, SEQ ID 
NO: 197, SEQ ID NO: 199, SEQ ID NO: 201, SEQ ID NO: 203, SEQ ID NO: 205, 
SEQ ID NO: 207, SEQ ID NO: 209, SEQ ID NO: 211, SEQ ID NO: 213, SEQ ID 
NO: 215, SEQ ID NO: 217, SEQ ID NO: 219, SEQ ID NO: 221, SEQ ID NO: 223, 

20 SEQ ID NO: 225, SEQ ID NO: 227, SEQ ID NO: 229, SEQ ID NO: 231, SEQ ID 
NO: 233, SEQ ID NO: 235, SEQ ID NO: 237, SEQ ID NO: 239, SEQ ID NO: 241, 
SEQ ID NO: 243, SEQ ID NO: 245, SEQ ID NO: 247, SEQ ID NO: 249, SEQ ID 
NO: 251, SEQ ID NO: 253, SEQ ID NO: 255, SEQ ID NO: 257, SEQ ID NO: 259, 
SEQ ID NO: 261, SEQ ID NO:, 263, SEQ ID NO:, 265, SEQ ID NO: 267, SEQ ID 

25 NO: 269, SEQ ID NO: 271, SEQ ID NO: 273, SEQ ID NO: 275, SEQ ID NO: 277, 
SEQ ID NO: 279, SEQ ID NO: 281, SEQ ID NO: 283, SEQ ID NO: 285, SEQ ID 
NO: 287, SEQ ID NO: 289, SEQ ID NO: 291, SEQ ID NO: 293, SEQ ID NO: 295, 
SEQ ID NO: 297, SEQ ID NO: 299, SEQ ID NO: 301, SEQ ID NO: 303, SEQ ID 
NO: 305, SEQ ID NO: 307, SEQ ID NO: 309, SEQ ID NO: 311, SEQ ID NO: 313, 

30 SEQ ID NO: 315, SEQ ID NO: 317, SEQ ID NO: 319, SEQ ID NO: 321, SEQ ID 
NO: 323, SEQ ID NO: 325, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, 
SEQ ID NO: 333, SEQ ID NO: 335, SEQ ID NO: 337, SEQ ID NO: 339, SEQ ID 
NO: 341, SEQ ID NO: 343, SEQ ID NO: 345, SEQ ID NO: 347, SEQ ID NO: 349, 
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(a) measuring values Xi to X n representative of at least one activity 
selected from the group consisting of binding a desirable odorant to a ligand- 
binding domain of at least one of n olfactory receptors, activating at least one 
of n olfactory receptors with a desirable odorant, and blocking at least one of n 

5 olfactory receptors with a desirable odorant; 

(b) generating a representation of the desirable odorant from the values Xi 
toX n ; 

(c) decomposing the desirable odorant's representation into 
representations of known odorants from the database of Claim 14 or 

1 0 superposing known odorants' representations from the database to reproduce 

the desirable odorant; and 

(d) formulating the known odorants to reproduce sensory perception of the 
desirable odorant and thereby produce the artificial odorant. 

15 17. An artificial odorant produced by Claim 1 6. 



18. A method of identifying a primary odorant related to sensory perception 
comprising: 

(a) providing a representative class of n olfactory receptors or ligand- 
20 binding domains thereof, 

(b) measuring at least one activity of a odorant selected from the group 
consisting of binding of the candidate odorant to the ligand-binding domain of 
at least one of the n olfactory receptors and activating with the candidate 
odorant at least one of the n olfactory receptors, and 

25 ( c ) identifying the candidate odorant as a primary odorant if only one or 

less than 10% of the representative class of n olfactory receptors is bound or 
activated; 

wherein at least one of the n olfactory receptors has an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
30 NO: 7, SEQ ID NO: 9, SEQ ID NO: 1 1 , SEQ ID NO: 1 3, SEQ ID NO: 1 5, SEQ ID 
. NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID 
NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 35, SEQ ID 
NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 43, SEQ ID NO: 45, SEQ ID 
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9. The method of Claim 1 , wherein the value measured for activating an olfactory 
receptor is above a preset limit for specific activation. 

1 0. The method of Claim 1 , wherein the value measured for blocking an olfactory 
5 receptor is at least a reduction in binding of the odorant or activation by the odorant. 

1 1 . The method of Claim 1 , wherein the representation of sensory perception is 
generated with a neural network. _ 

10 12. A biosensor comprised of the n ligand-binding domains or olfactory receptors 
of Claim 1. 

13. A method for producing a database of odorant representations comprising: 
(a) providing one or more known odorants and 

1 5 (b) generating a representation of the one or more known odorants in 

accordance with the method of Claim 1 to produce the database. 

14. A database produced by Claim 13. 

20 15. A method of identifying an unknown odorant comprising: 

(a) measuring values Xi to X n representative of at least one activity 
selected from the group consisting of binding the unknown odorant to a 
ligand-binding domain of at least one of n olfactory receptors, activating at 
least one of n olfactory receptors with the unknown odorant, and blocking at 

25 least one of n olfactory receptors with the unknown odorant; 

(b) generating a representation of the unknown odorant from the values Xi 
to X n ; and 

(c) comparing the unknown odorant' s representation to the database of 
Claim 14 to identify the known odorant which is most similar in 

30 representation. 

16. A method of producing an artificial odorant comprising: 
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NO: 467, SEQ ID NO: 469, SEQ ID NO: 471, SEQ ID NO: 473, SEQ ID NO: 475, 
SEQ ID NO: 477, SEQ ID NO: 479, SEQ ID NO: 481, SEQ ID NO: 483, SEQ ID 
NO: 485, SEQ ID NO: 487, SEQ ID NO: 489, SEQ ID NO: 491, SEQ ID NO: 493, 
SEQ ID NO: 495, SEQ ID NO: 497, SEQ ID NO: 499, SEQ ID NO: 501, SEQ ID 
5 NO: 503, SEQ ID NO: 505, SEQ ID NO: 507, SEQ ID NO: 509 and SEQ ID NO: 
511. 

2. The method of Claim 1, wherein at least one of the olfactory receptors 
specifically recognizes the odorant, and there are between 5 and 350 of the n olfactory 

10 receptors selected from the listed amino acid sequences. 

3. The method of Claim 1 , wherein at least two different activities are measured 
to provide the values Xi to X n . 

15 4. The method of Claim 1, wherein each odorant receptor is expressed in cells, 
and the cells expressing each odorant receptor are located at an identifiable position. 

5. The method of Claim 1, wherein at least one olfactory receptor is soluble, and 
binding of odorant to a ligand-binding domain of the soluble olfactory receptor is 

20 measured in solution. 

6. The method of Claim 1 , wherein at least one olfactory receptor is in solid state, 
and binding of odorant to a ligand-binding domain of the solid-state olfactory receptor 
is measured on a substrate. 

25 " 

7. The method of Claim 1, wherein the value measured for binding is above a 
preset limit for specific binding to olfactory receptors. 

8. - The method of Claim 1, wherein the value measured for activating an olfactory 
30 receptor is derived from a signal selected from the group consisting of intracellular 

Ca 2+ , cAMP, cGMP and IP3. 
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SEQ ID NO: 171, SEQ ID NO: 173, SEQ ID NO: 175, SEQ ID NO: 177, SEQ ID 
NO: 179, SEQ ID NO: 181, SEQ ID NO: 183, SEQ ID NO: 185, SEQ ID NO: 187, 
SEQ ID NO: 189, SEQ ID NO: 191, SEQ ID NO: 193, SEQ ID NO: 195, SEQ ID 
NO: 197, SEQ ID NO: 199, SEQ ID NO: 201, SEQ ID NO: 203, SEQ ID NO: 205, 
5 SEQ ID NO: 207, SEQ ID NO: 209, SEQ ID NO: 21 1, SEQ ID NO: 213, SEQ ID 
NO: 215, SEQ ID NO: 217, SEQ ID NO: 219, SEQ ID NO: 221, SEQ ID NO: 223, 
SEQ ID NO: 225, SEQ ID NO: 227, SEQ ID NO: 229, SEQ ID NO: 231, SEQ ID 
NO: 233, SEQ ID NO: 235, SEQ ID NO: 237, SEQ ID NO: 239, SEQ ID NO: 241, 
SEQ ID NO: 243, SEQ ID NO: 245, SEQ ID NO: 247, SEQ ID NO: 249, SEQ ID 

10 NO: 251, SEQ ID NO: 253, SEQ ID NO: 255, SEQ ID NO: 257, SEQ ID NO: 259, 
SEQ ID NO: 261 , SEQ ID NO:, 263, SEQ ID NO:, 265, SEQ ID NO: 267, SEQ ID 
NO: 269, SEQ ID NO: 271, SEQ ID NO: 273, SEQ ID NO: 275, SEQ ID NO: 277, 
SEQ ID NO: 279, SEQ ID NO: 281, SEQ ID NO: 283, SEQ ID NO: 285, SEQ ID 
NO: 287, SEQ ID NO: 289, SEQ ID NO: 291, SEQ ID NO: 293, SEQ ID NO: 295, 

15 SEQ ID NO: 297, SEQ ID NO: 299, SEQ ID NO: 301, SEQ ID NO: 303, SEQ ID 
NO: 305, SEQ ID NO: 307, SEQ ID NO: 309, SEQ ID NO: 311, SEQ ID NO: 313, 
SEQ ID NO: 315, SEQ ID NO: 317, SEQ ID NO: 319, SEQ ID NO: 321, SEQ ID 
NO: 323, SEQ ID NO: 325, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, 
SEQ ID NO: 333, SEQ ID NO: 335, SEQ ID NO: 337, SEQ ID NO: 339, SEQ ID 

20 NO: 341 , SEQ ID NO: 343, SEQ ID NO: 345, SEQ ID NO: 347, SEQ ID NO: 349, 
SEQ ID NO: 351, SEQ ID NO: 353, SEQ ID NO: 355, SEQ ID NO: 357, SEQ ID 
NO: 359, SEQ ID NO: 361, SEQ ID NO: 363, SEQ ID NO: 365, SEQ ED NO: 367, 
SEQ ID NO: 369, SEQ ID NO: 371, SEQ ID NO: 373, SEQ ID NO: 375, SEQ ID 
NO: 377, SEQ ID NO: 379, SEQ ID NO: 381, SEQ ID NO: 383, SEQ ID NO: 385, 

25 SEQ ID NO: 387, SEQ ID NO: 389, SEQ ID NO: 391, SEQ ID NO: 393, SEQ ID 
NO: 395, SEQ ID NO: 397, SEQ ID NO: 399, SEQ ID NO: 401, SEQ ID NO: 403, 
SEQ ID NO: 405, SEQ ID NO: 407, SEQ ID NO: 409, SEQ ID NO: 41 1, SEQ ID 
NO: 413, SEQ ID NO: 415, SEQ ID NO: 417, SEQ ID NO: 419, SEQ ID NO: 421, 
SEQ ID NO: 423, SEQ ID NO: 425, SEQ ID NO: 427, SEQ ID NO: 429, SEQ ID 

30 NO: 431, SEQ ID NO: 433, SEQ ID NO: 435, SEQ ID NO: 437, SEQ ID NO: 439, 
SEQ ID NO: 441, SEQ ID NO: 443, SEQ ID NO: 445, SEQ ID NO: 447, SEQ ID 
NO: 449, SEQ ID NO: 451, SEQ ID NO: 453, SEQ ID NO: 455, SEQ ID NO: 457, 
SEQ ID NO: 459, SEQ ID NO: 461, SEQ ID NO: 463, SEQ ID NO: 465, SEQ ID 
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Claims : 

1 . A method for representing sensory perception of one or more odorants 
comprising: 

5 (a) providing a representative class of n olfactory receptors or ligand- 

binding domains thereof; 

(b) measuring values X) to X n representative of at least one activity of the 
one or more odorants selected from the group consisting of binding of the one 
or more odorants to the ligand-binding domain of at least one of the n 

1 0 olfactory receptors, activating at least one of the n olfactory receptors with the 

one or more odorants, and blocking at least one of the n olfactory receptors 
with the one or more odorants; and 

(c) generating a representation of sensory perception from the values Xj to 

x«; 

1 5 wherein at least one of the n olfactory receptors has an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ED NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID 
NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 35, SEQ ID 

20 NO: 37, SEQ ID NO: 39, SEQ ID NO: 41 , SEQ ID NO: 43, SEQ ID NO: 45, SEQ ID 
NO: 47, SEQ ID NO: 49, SEQ ID NO: 51, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID 
NO: 57, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 63, SEQ ID NO: 65, SEQ ID 
NO: 67, SEQ ID NO: 69, SEQ ID NO: 71, SEQ ID NO: 73, SEQ ID NO: 75, SEQ ID 
NO: 77, SEQ ID NO: 79, SEQ ID NO: 81, SEQ ID NO: 83, SEQ ID NO: 85, SEQ ID 

25 NO: 87, SEQ ID NO: 89, SEQ ID NO: 91, SEQ ID NO: 93, SEQ ID NO: 95, SEQ ID 
NO: 97, SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, SEQ 
ID NO: 107, SEQ DO NO: 109, SEQ ID NO: 1 1 1, SEQ ID NO: 113, SEQ ID NO: 1 15, 
SEQ ID NO: 1 17, SEQ ID NO: 119, SEQ ID NO: 121, SEQ ID NO: 123, SEQ ID 
NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 131, SEQ ID NO: 133, 

30 SEQ ID NO: 135, SEQ ID NO: 137, SEQ ID NO: 139, SEQ ID NO: 141, SEQ ID 
NO: 143, SEQ ID NO: 145, SEQ ID NO: 147, SEQ ID NO: 149, SEQ ID NO: 151, 
SEQ ID NO: 153, SEQ ID NO: 155, SEQ ID NO: 157, SEQ ID NO: 159, SEQ ID 
NO: 161, SEQ ID NO: 163, SEQ ID NO: 165, SEQ ID NO: 167, SEQ ID NO: 169, 
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TGTTCTTCATCCATTTCTTCACTGTCATGGAGAGCATCGTATTGGTGGCCATGGCCTTTGAC 

CGCTACATTGCCATTTGCAAACCT(nTTGGTACACCATGATCCTCACCAGCAAAATCATCA 

GCCTCATTGCAGGCATTGCTGTCCTGAGG 

CCTCTTAAGGTTGCCCTTCTGTGGACATCGTATCATCCCTCATACTTACTGTGA 
5 GCATTGCCCGTCTGGCCTGTGCCAGCATCAAAGTCAACATTATGTTTGGTCTTGGCAGTAT 
TTCTCTCITGTTATTGGATGTGCTCCTTATrATTCTCTCCCATATCAGGATCCT 
CTTCTGCCTGCCCTCCTGGGAAGCTCGACTCAAAGCTCTCAACACCTGTGGCTCTCACATT 
GGTGTTATCTTAGCCTTTTCTACA 

GATATTCCCCAATATATCCACATTTTCTTGGCTAATCTATATGTGGTTGTTCCTCCCACCCT 
10 CAATCCTGTAATCTATGGGGTCAGAACCAAACATATTAGGGAGACAGTGCTGAGGATTTTC 
TTCAAGACAGATCACTAA (SEQ ID NO: 510) 



AOLFR328 sequences: 
MALGNHSTITEFLLLGLSADPNII^^ 
1 5 LSFVDLCFSSVIWKMLENLLSQRK 

LLYGQIMGKQLYMHLVWGSWGLGFLDALI^ 
SRSLIVLLCSTLLHGLGOTLLWLSYTO^ 

MPNSGSPIELIFSVQYTWTPMLNSLIYSLKNKEVKVALKRTLEKYLQYT^ (SEQ ID NO: 5 1 1) 

i 

20 ATGGCCTTGGGGAATCACAGCACCATCACCGAGTTCCTCCTCCTTGGGCTGTCTGCCGACC 
CCAACATCCGGGCTCTGCTCTTTGTGCTGTTCCTGGGGATTTACCTCCTGACCATAATGGA 
AAACCTGATGCTGCTGCTCGTGATCAGGGCTGATTCTTGTCTCCATAAGCCCATGTATTTCT 
TCCTGAGTCACCTCTCrTTTGTTGATCTCTGCTTCTCTTCAG 
GAGAACCTCCTGTCACAGAGGAAAACCATTTCAGT^ 

25 TTGTGTTTGTCACTGCAGGGACTGAAGCCTGCCTTCTCTCAGGGATGGCCTATGACCGCCA 
TGCTGCCATCCGCCGCCCACTACTTTATGGACAGATCATGGGTAAACAGCTGTATATGCAC 
CTTGTGTGGGGCTCATGGGGACTGGGCTTTCTGGACGCACTCATCAATGTCCTCCTAGCTG 
TAAACATGGTCTTTTGTGAAGCCAAAATCATTCACCACTACAGCTATGAGATGCCATCCCT 
CCTCCCTCTGTCCTGCTCTGATATCTCCAGAAGCCrrCATCGTTTTGCT 

30 TACATGGGCTGGGAAACTTCCITTTGGTCT 

CTAAGCATCAGCTCTACCTCGGGCAGAAGCAAGGCCTTCTCCACCTGCTCT 
CTGCAGTGACACTTTACTATGGCTCAGGTTTGCTCCGCCATCTCATGCCAAACT 
CCCCATAGAGTTGATCTTCTCTGTGCAGTATACTGTAGTCACTCCCATGCTGAATTCCCTCA 
TCTATAGCCTGAAAAATAAGGAAGTGAAGGTAGCTCT 

3 5 TGCAATATACCAGACGTTGA (SEQ ID NO: 5 1 2) 
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ATCTGGAATGATAGTTTTAATCAGAA^ 

CCTCAGTCATCTGTCITTCTGTGATCrCTGCTATTCTACTGCAACTGGGCC 

TAGATCTACTTGCCAAGAACAAGTCAATACCCTTCTATGGCrGTGCTCTG^AA^^^"G^^ 

C^CTGTATCTTTGCAGATTCTGAGTGTCTACTGCTGTCAGTGATGGCC^^^^T^^^^^A 

CTTGACTGGGGTTTATCTGGTGGGAATAGCAGATGCTTTGATACATATGACACTGGCCTTC 
CGCCTATGCirCTGTGGGTCTAATGAGATTAATCATrrCTTCTGTGAT^ 

TTCAACTGAGTACCATTTCAGGAGTTTTCATTTC™^ 

GAGATACACTCTGCTGAGGGGAGGTTCAAAGCTCTCTCTACATGCACTTCCCACTTATCT 

CGGTrGCAATTTTCCAGGGAACTCTGCTCTTTATGTATITCCGGCCAA^ 

C ^ A ^ TCAAGATAAAATGA C^ 




ATGGAGGTGAAGAACTGCTGCATGGTGACAGAGTTCATCCTTTTGGGAATCCCACArArA 
GAGGGGCTGGAGATGACACTTTTTGTCTTATTCTTGCCCTTCTA 

TCCTGGG^CITGTCTGTGTTTGACATGro 
G^ACCTTATGGGGCTGAGCCGACTCATCTCCTACA^^ 

iSI^^G^ATCCTCTGCGATACACAGTCATCATGAACCCAAGGATCTGTG^^^ 

TGGCTGTGGGCACATGGCTGTTAGGGTGCATTCATTCCAGTATCTTGACCTCCCTCACCTrc 
ACCTTGCCATACTGTGGTCCC^ 

?S CC " G ^ CTGTGCTGACACATCOTAGCCCAGAG ^^ 

AAGCATTCGTACAACTGAGGGCCGTCGCCGTGCCTTCTCCACCTGCAGTGCTCACCTCA^ 
^TCCTCTGTGCCTATGGGCCCATCATCACTC^^ 

TCCTGGGAACCGTGGTACAAATTCTCATGAATCTGGTAGGA^ 
CTATACCTTGAGGAATAAGGAAGTAAAAACA^ 

CCATGTTCCTGAGAGTTAG (SEQ ID NO: 508) ^AOOAtAOQ 
AOLFR324B sequences: 

DIPQYlHIFLANLYVVVPPTLNP^YGWTKHIRETXaRIFFKTO (SEQ ID NO. 509) 
i^ CC 7 ATAGCTAACGACACCCAG ™ CATAOTOTC ATTCCTACT^ 

?^? AA 2™™ ACATCTGGA ™ GA ^ cccimmCT 

™ GGA ^™ CTGCTATOTCm ^ 

ATGCTGGGCATCITCTGGTTCAATATCAAGGAAATATCTTTTGGAGGCTACCrTTCTCAGA 
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TCCCTCTCTTATGGCAGCTGCATGTTTATGTACATTAATCCTTCTGCAAAAGAAGGAGGTG 

(mCAACAAAGGAATAGCTGTACTCATTACTTCGGTTACTCCCTTACTGAATCCCTTCATA 

TATACTTTAAGAAATCAGCAAGTGAAACAAGCTTTCAAGGACTCAGTCAAAAAGATTGTG 

AAACTTTAA (SEQ ID NO: 500) 

5 

AOLFR274B sequences: 

MEFVFLAYPSCPELHILSFLGVSLVYGLnTGNILIWSIHTETCLCTSMYYFLGSLSGIEICYTAV 
VVPHILANTLQSEKTITLLGCATQMAFF1ALGSADCFLLAAMAYDRYVAICHPLQYPLLMTLTL 
CVHLWASVISGLFLSLQLVAFIFSLPFCQAQGIEHFFCDVPPVMHWCAQSHIHEQSVLVAAIL 
10 AIAVPFFLITTSYTFIVAALLKJHSAAGRHRAFSTCSSHLTVVLLQYGCCAFMYLCPSSSYNPKQ 
DRFISLVYTLGTPLLNPLIYALRNSEMKGAVGRVLTRNCLSQNS (SEQ ID NO: 501) 

ATGGAATTTGTGTTCCTGGCCTATCCCTGCTGCCCAGAACTGCATATTCTGTCCTTCCTTGG 
GGTCAGCCTGGTTTATGGTTTGATCATCACTGGGAACATTCTCATTGTGGTGTCCATTCAC 

1 5 ACAGAAACCrGTCTATGCACATCCATGTACTATTTCCTGGGCAGCCTTTCTGGGATTGAAA 
TATGCTACACTGCAGTGGTGGTGCCCCATATCCTGGCCAACACCCTACAGTCAGAGAAGAC 
CATCACTCTCCTGGGCTGTGCCACCCAGATGGCTrTCTTCATTGCACTGGGCAGTGCTGAT 
TGC^CCTCTTGGCTGCCATGGCCTATGACCGCTATGTGGCCATTTGCCACCCGTTGCAGTA 
CCCTCTCCTCATGACATTGACTCTTTGTGTCCACTTGGTTGTGGCATCAGTCATCAGTGGTC 

20 TGTTCCTGTCCTTACAACTGGTGGCCTTCATCTTCTCTCTGCCATTCTGCCAGGCTCAGGGC 
ATTGAGCACTTCTTTTGTGATGTGCCACCAGTCATGCATGTTGTTTGTGCTCAGAGTCACAT 
TCATGAGCAGTCAGTGCrGGTGGCAGCCATACTAGCCATTGCTGTGCCTTrCTTCCTCATC 
ACCACCTCCTACACCTTCATAGTGGCTGCTCTGCTCAAGATCCACT^^ 

ACCGGGCC^CTCCACCTGCTCTTCCCACCTCACTG^ 
25 GCCTTCATGTACCTGTGCCCCAGCTCCAGCTACAACCCCAAGCAAGATCGGTTCATCTCAC 
TGGTGTACACATTGGGAACCCCACTGCTCAACCCACTTATCTATGCCCTGAGGAACAGTGA 
GATGAAAGGGGCCGTAGGGAGAGTTCTTACCAGGAACTGCCTTTCCCAGAACAGCTAG 

(SEQ ID NO: 502) 

30 AOLFR276B sequences: 

MGGFGTMSSTTSFTT-TGFPEMKGLEHWLAALLLLLYAISFLGNILILFIIKEEQSLHQPMYYFLS 
LFS\T^LGVSFSTLPTVLAAVCFHAPETTFDACLAQMFFIHFSSWTEFGILLAMSFDHYVAICNP 

LRYATVLTDWVAHNGISIVIRSFCMV^^ 

TmSMYGLFIVISAFGVDSLLILLSYVLILHSVLAIASRGERLKTLNTCVSHIYAVLIFYVPMVSVS 
3 5 MVHRFGRHAPEYVHKFMSLCTSNALPNYLFHQD (SEQ ID NO: 503) 

ATGGGGGGCTTTGGGACTAACATCTCAAGTACTACCAGCTTCACTCrAACAGGCTTCCCTrc 
AGATGAAGGGTCTGGAGCACTGGCTGGCTGCCCTTCTGCTGCT 

CCTGGGCAACATCCTCATCCTCTTTATCATAAAGGAAGAGCAGAGCTTGCACCAGCCAATG 

40 TACTAC^CTG^ 

GTACTGGCTGCTGTGTGTTrTCATGCCCCAGAGACAAGTTTTGATGCCTGGGTGGCCCAGA 

TGTTCTTCATCCACTTTTCCrCCTGGACAGAGTTTGGCATCCrACTGGCCATGAGTTTTGAC 

CACTATGTGGCCATCTGTAACCCGCTGCGCTATGCCACAGTGCTCACTGATGTCCOTGT^ 

CCCACAATGGGATATCCATTGTCATCCGCAGCTTCTGCATGGTATTCCCACTTC 

45 CTGAAGAGACrGCCTTTCrGTAAGGCCAGTGTGGTACTGGCCCATTCCTACTGTCTGCATG 

cIgactgaScggc^^ 

TGTCATCTCTGCCTTTGGTGTAGATTCACTGCTC 

ATTCTGTGCTGGCCATTGCCTCCAGGGGTGAGAGGCTTAAGACACTCAACACATGTGTGTC 
ACATATCTATGCAGTGCTGATCTTCTATGTGCCTATGGTTAGTGTGTCCATGGTTCATCGAT 
50 ^GGGAGGCATGCTCCTGAATATGTGCACAAGTTCATGTCTCTTTGTACCTCCAATGCTCT 
ACCCAATTATCTATTCCATCAAGACTAA (SEQ ID NO: 504) 

AOLFR311B sequences: 

MDWENCSSLTDFFLLG1T>JOTEMKVTLFAW 

55 ScdSystatgpkmlvdllaknksipfygcalqflw 
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GCCTATGACCGATATGTGGCCATTTGTAACCCTCTATTGTATATGGTTGTAATGACTCCAG 

GAATCTGCATTCAACTTGTAGCAGTTCCTTATAGCTATAGCTTCCTAATGGCACTArrrCAC 

ACCATCCTCACCTTCCGCCTCTCCTATTGCCACTCCAACATTGTCAACCATTTCTATTGTGA 

TGACATGCCTCTCCTCAGGCTAACITGCTCAGACACTCGCTTCAAACAGCTCTGGATCrrr 

5 GCCTGTGCTGGTATCATGTTCATTrCCTCCCTTCTGATTGTCTTTGTCTCCTACATGTTCATC 

ATTTCTGCCATCCTGAGGATGCATTCAGCTGAGGGAAGACAGAAGGCTTTCTCGACGTGTG 

GCTCTCACATGCTGGCAGTCACCATATTCTATGGGACCCTCATTTTTATGTACTTACAGCCT 

AGCTCTAGCCATGCCCTGGACACAGACAAGATGGCCTCTGTCTTCTACACAGTGATCATTC 

CCATGTTGAATCCCTTAATCTATAGCCTCCAGAATAAGGAGGTGAAAGAAGCTCTGAAGA 
1 0 AAATCATTATCAATAAAAACTAG (SEQ ED NO: 496) 



AOLFR171C sequences: 

MAEVNIIYVTWILKGITNRPELQAPCFGWLVIYLVTVLGNLGLITLIKIDTRLHTPMYYF^ 
AFVDLCYSSAITPKMMVNFVVERNTIPFHACATQLGCFLT7MITECFLLASMAYDCYVAICSPL 
1 5 HYSTLMSRRVCIQLVAVP YIYSFLVALFHTVITFRLTYCGPNLINHFYCDDLPF1ALSCSDTHMK 
EILIFAFAGFDMISSSSIVLTSYIFIIAAILRIRSTQGQHKAISTCGSHMVTVTIFYGTLIFMYL 
NHSLDTDKMASVFYTVVIPMLNPLIYSLRNKE^^ 



20 ATGGCTGAAGTTAATATCATTTATGTCACTGTATTCATTCTGAAAGGAATTACCAACCGGC 
CAGAGCTTCAGGCCCCGTGCTTTGGGGTGTTTTTAGTTATCTATCTGGTCACAGTGCTGGG 
CAATCTTGGGTTGATTACTTTAATCAAGATTGATACTCGACTCCACACACCTATGTACTATT 
TCCTCAGCCACCTGGCCITTGTTGACCTTTGTTACTCCTCTGCTATTACACCGAAGATGATG 

^ G 3^^ -imG ™TGGAACGCAACACCATTCCTTTCCATGCTTGTGCAACCCAACTGGG'TT 

25 GTTTTCTCACCITCATGATCACTGAGTGTTTCCTTCTAGCCTCCATGGCCTACGATTGCTAT 
GTCGCCATCTGTAGTCCCCTGCATTATTCAACACTGATGTCAAGAAGAGTCTGCATTCAAC 
TGGTGGCAGTTCCATATATATACAGCTTCCTGGTTGCCCTCTTCCACACCGTTATCACTTTC 
CGTCTGACTTACTGTGGCCCAAACTTAATTAACCATTTCTATTGTGATGACCTCCCCTTCTT 
AGCTCTGTCCTGCTCAGACACACACATGAAGGAAATTCTGATATTTGCCTTTGCTGGCTTT 

30 GATATGATCTCTrCCTCTTCCATTGTCCTCACCTCCTACATCTTTATTATTGCCGCTATCCTA 
AGGATCCGCTCTACTCAGGGGCAACACAAAGCCATTTCCACCTGTGGCTCCCATATGGTGA 

ctgtcactattttctatggcacactgatctttatgtacctacagcccaaatcaaatcactcc 

TTGGACACAGACAAGATGGCTTCTGTATTTTACACAGTGGTGATCCCCATGTTAAACCCCC 
TAATCTATAGTCTAAGGAACAAAGAAGTGAAAGATGCCTCAAAGAAAGCCTTGGATAAAG 
35 GTTGTGAAAACTTACAGATATTAACATTTTTAAAAATAAGAAAACTTTATTAA (SEQ ID NO- 



AOLFR225B sequences: 

MKNRTMFGEFnXGLTNQPELQVMIITFLFLTYML^ 
40 LEISFTSIFIPPJLTSMTTGNKVISFAGCLTQYFFAJFLGATEFYLIASMSYDRYVAICKPLHYLTI 

MSSRVaQLWCSWLGGFLAJLPPIIIMTQVDFCVSNILNHYYCDYGPLVEIACSDTSlXEUvm 
LLAVVTLMVTLVLVTI^YTY7IRT1LPJU°SAQQR 

GGAFNKGIAVLITSVTPLLNPFIYTLRNQQVKQAFKDSVKKIVKL (SEQ ID NO: 499) 

45 ATGAAAAACAGAACCATGTTTGGTGAGTTTATTCTACTGGGCCTTACAAATCAACCTGAAC 
TCCAAGTGATGATATTCATCTTTCTGTTCCTCACCTACATGCTAAGTATCCTAGGAAATCTG 
ACTATTATCACCCTCACCTTACTAGACCCCCACCTCCAGACCCCCATGTATTTCTTCCTCCG 
GAATTTCTCCTTCTTAGAAATTTCCTTCACATCCATTTTTATTCCCAGATTTCTGACCAGCA 
TGACAACAGGAAATAAAGTTATCAGCTTTGCTGGCTGCTTGACTCAGTATTTTTTTGCTAT 

50 Ai 1 1 CTl GGAGCTACCGAGTTTTACCTCCTGGCCTCCATGTCTTATGATCGTTATGTGGCCA 
TCTGCAAACCCTTGCATTACCTGACTATTATGAGCAGCAGAGTCTGCATACAACTAGTGTT 
CTGCTCCTGGTTGGGGGGATTCCTAGCAATCTTACCACCAATCATCCTGATGACCCAGGTA 
GATTTCTGTGTCTCCAACATTCTGAATCACTATTACTGTGACTATGGGCCTCTCGTGGAGCT 
TGCCTGCTCAGACACAAGCCTtrrTAGAACTGATGGTCATCCTCTTGGCCGTTGTGACTCTC 

55 ATGGTTACTCTGGTGCTGGTGACACTTTCTTACACATACATTATCAGGACTATTCTGAGGA 
TCCCTTCTGCCCAGCAAAGGACAAAGGCCTTTTCCACTTGTTCCTCCCACATGA-rTGTCATC 
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ATGAAAAATAAAACCGTGTTAACTGAGTTTATCCTTCTGGGTCTAACAGATGTCCCTGAAC 

tccAggtggcagttttcacctttcttttccttgcgtatttactcagcatccttggaaatctg 

ACTATCCTCATCCTCACCTTGCTGGACTCCCACCTTCAGACTCCCATGTATTTCTTTCTCCG 
5 GAACTTCrCCTTCTTGGAAATTTCCTTCACAAACATCTTCATTCCAAGGGTCCTGATTAGCA 
TCACAACAGGGAACAAGAGTATCAGCTTTGCTGGCTGCTTCACTCAGTATTTCTTTGCCAT 
GTTCCTTGGGGCTACAGAGTTTTACCTTCTGGCTGCCATGTCCTATGACCGCTATGTGGCC 
ATCTGCAAACCTCTGCATTACACCACCATCATGAGCAGCAGAATCTGCATCCAGCTGATTT 
TCTGCTCTTGGCTGGGTGGGCTAATGGCTATTATACCAACAATCACCCTGATGAGTCAGCA 

1 0 GGACTTTTGTGCATCCAACAGACTGAATCATTACTTCTGTGACTATGAGCCTCTTCTGGAA 
CTCTCATGTTCAGACACAAGCCTCATAGAGAAGGTTGTCTTTCTTGTGGCATCTGTGACCC 
TGGTGGTCACTCTGGTGCTAGTGATTCTCTCCTATGCATTCATTATCAAGACTATTCTGAAG 
CTCCCCTCrGCCCAACAAAGGACAAAAGCCTTTTCCACATGTTCTTCCCACATGATTGTCAT 
CTCCCTCTCTTACGGAAGCTGCATGTTTATGTACATTAATCCCTCTGCAAAAGAAGGGGAT 

1 5 ACATTCAACAAGGGAGTAGCTCTACTCATTACTTCAGTTGCTCCTTTGTTGAACCCCTTTAT 
TTACACCCTAAGGAACCAACAGGTAAAACAACCCTTCAAGGATATGGTCAAAAAGCTTCT 

GAATCTTTAA (SEQ ID NO: 492) 



AOLFR130B sequences: • 
20 MEGKNQTAPSEFnLGFDHLNELQYLLFTIFFLTYICTLGGNVFIIVVTlADSHLHTPMYYFLGNL 
ALID1CYTTTNWQMMVHLLSEKKIISYGGCVTQLFAFIFFVGSECLLLAAMAYDRYIAICKPLR 
YSFIMNKALCSW1AASCWTCGFLNSVLHWLTFHLPFCGNNQINYFFCDIPPLL1LSCGDTSLNE 
LALLSIGILISWTPFLCIII^YLYIISTILPJRSSEGPJIKAFSTCASHLLIVILYYGSAIFrYVPJ>K 
LEKDPXISVLYSWTPMLNPVTYTLRNKDIKEAVKAIGRKWQPPVFSSDI (SEQ ID NO: 493) 

25 

ATGGAAGGAAAGAATCAAACAGCTCCATCTGAATTCATCATCTTGGGGTTCGACCACCTGA 
ATGAATTGCAGTATTTACTCTTCACCATCTTCTTTCTGACCTACATATGCACTTTAGGAGGC 
AATGTTTTTATCATTGTGGTGACCATAGCTGATTCCCACCTACACACACCCATGTATTATTT 
CCTAGGAAATCTTGCCCTTATTGACATCTGCTACACTACTACTAATGTCCCCCAGATGATG 

30 GTGCATCTTCrGTCAGAGAAGAAAATCATTTCCTATGGAGGCTGTGTGACCCAGCTCTTTG 
CATTCATTTTCTTTGTTGGCTCAGAGTGTCTCCTCCTGGCAGCAATGGCATATGATCGATAT 
ATTGCTATCTGTAAGCCGTTAAGGTACTCATTTATTATGAACAAGGCCCTGTGCAGCTGGT 
TAGCAGCCTCATGCTGGACATGTGGGTTTCTCAACTCAGTGTTGCACACCGTTCTGACCTT 
CCACCTGCCCTrCTGTGGTAACAATCAGATCAATTATTTCTTCTGTGACATACCTCCCTTGC 

35 TCATCTTGTCTTGTGGTGATACTTCCCTCAATGAACTGGCITTGCrGTCCATTGGGATCCTC 
ATAAGCTGGACTCCTTTCCTGTGCATCATCCTTTCCTACCTTTACATCATCTCCACCATCCT 
GAGGATCCGTTCCTCTGAGGGGAGGCACAAAGCCTTTTCCACCTGTGCCTGCCACCTGCTC 
ATTGTTATTCTCTATTATGGCAGTGCTATCTTCACGTATGTGAGGCCCATCTCATCTTACTC 
TCTAGAGAAAGATAGATTGATCTCAGTGCTGTATAGTGTTGTCACACCCATGCTGAATCCT 

40 GTAATTTATACGCTAAGGAATAAGGACATCAAAGAGGCTGTGAAGGCCATAGGGAGAAAG 
TGGCAGCCACCAGTTTTCTCTTCTGATATATAA (SEQ ID NO: 494) 



AOLFR142B sequences: 

MAP^MAHmCTQATEFILVGLTDHQELKMPLFVLFLSr/LFTWGNLGLILLniADTSLmTM 
45 WFLSNLAFVDFCYSSVITPKMLGNFLYKQNVISFDACATQLGCFLTFMISESLLLASMAYDRY 
VAICM>LLYMVVMTPGICIQLVAVPYSYSFLMALFHTILTFRI^YCHSNIVNHFYCDDMPLLRL 
TCSDTRFKQLWIFACAGIMFISSLLIWVSYMFnSAILRMHSAEGRQKAFSTCGSHMLAVTIFYG 
TLn 7 MYLQPSSSHALDTDKMASWYTVIIPMLNPLr^SLQNKEVKEALKKIIINKN (SEQ ID NO: 

495) 

50 

ATGGCCAGAAAAGATATGGCTCACATCAATTGCACCCAGGCGACAGAGTTTATTCTTGTGG 
GCCTCACAGACCATCAGGAGTTGAAGATGCCCCTCrTTGTGCTATTCTTATCCATCTACCTC 
TTCACAGTGGTAGGCAACTTGGGTTTGATCCTACTCATTAGAGCGGATACAAGTCTCAACA 
CACCAATGTACITCTTTCTTAGCAACCTAGCTTTTGTGGATTTCTGTTACTCrrCTGTCA^ 
55 ACACCCAAAATGCTTGGGAATTTCTTGTACAAACAAAATGTTATATCCTTTGATGCATGTG 
CTACTCAACTGGGCTGCTTTCTCACCTTCATGATATCAGAATCCTTGCTACTGGCTTCCATG 
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AOLFR24B sequences: 
MPSINDTHFYPPFFLLLGIPGLDTLHIW]^ 

I^IvnDLGLSTSTIPKMLGIFlVFNLQEISFGGClXQMFFIHMFTGMETVLLVVMAYD 
LQYTMILTNKTISILASVVA'GRNLVLVTPFVFLI^^ 

HYGLMVISYIIVDVILMSSWLILRAWRLPSQDVPXKAFNTCGSHVC^ 
RFGQNIPHYIHILLANLYVWPPALNPVIYGVRTKQIREQIVKIFVQKE (SEQ ID NO- 487) 



5 



20 



ATGCCTTCTATCAATGACACCCACTTCTATCCCCCCTTCTTCCTCCTGCTAGGAATACCAGG 

ACTGGACACTTTACATATCTGGATTTCTTTCCCATTCTGTATTGTGTACCTGATTGCCATTG 

1 0 TGGGGAATATGACCATTCTCTTTGTGATCAAAACTGAACATAGTCTACACCAGCCCATGTT 

CTACTTCCTGGCCATGTTGTCTATGATTGATCTGGGTCTGTCCACATCCACTATCCCCAAAA 

TGCTAGGAATCTTCTGGTTCAACCTCCAAGAGATCAGCITTGGGGGATGCCTTCTTCAGAT 

GTTCTTTATTCACATGTTTACAGGCATGGAGACTGTTCTGTTGGTGGTCATGGCTTATGACC 

GCTTTGTTGCCATCTGCAACCCTCTCCAGTACACCATGATCCTCACCAATAAAACCATCAG 

1 5 TATCCTAGCTTCTGTGGTTGTTGGAAGAAATTTAGTTCTTGTAACCCCATTTGTGTTTCTCA 

TTCTGCGTCTGCCATTCTGTGGGCATAACATCGTACCTCACACATACTGTGAGCACAGGGG 

TCTGGCCGGGTTGGCCTGTGCACCCATTAAGATCAACATAATCTATGGGCTCATGGTGATT 

TCITATATTATTGTGGATGTGATCTTAATTGCCrCTTCCTATGTGCrTATCCTTAGAGCTGT 

TTTTCGCCTTCCCTCrCAAGATGTCCGACTAAAGGCCTTCAATACCTGTGG'TTCTCATGTCT 

GTGTTATGCTGTGCTTTTACACACCAGCATTTTTTTCTrTTATGACACATCGTTTTGGCCAA 

AACATTCCCCACTATATCCATATTCTTTTGGCTAACCTGTATGTGGTTGTCCCACCTGCCCT 

TAACCCTGTCATTTATGGAGTCAGGACCAAGCAGATCCGAGAGCAAATTGTGAAAATATTT 
GTACAGAAAGAATAA (SEQ ID NO: 488) 

25 AOLFR33B sequences: 

MLHTNNTQFHPSTFLVVGWGLEDVHVWIGFPFFAWLTALLGNIIIIJVIQTEQSLHQ 
AMI^GTDLGL^TATJPKMLGIFWFNLGEIAFGAClTQMYTra 

LRYSMILTT^VIAILGIVirVRTLVFVTPFTFLTLPXPFCGVRIIPHTYCEHMGLAKLACASINVIY 
GLIAFSVGYIDISVIGFSYVQILRAVFTlLPAWDAEiKAI^TCGSHVCVMLAFYLPALFSFMTHRF 

ATGCTTCATACCAACAATACACAGTTTCACCCTTCCACCTTCCTCGTAGTGGGGGTCCCAG 
GGCTGGAAGATGTGCATGTATGGATTGGCTTCCCCTTCTTTGCGGTGTATCTAACAGCCCT 

35 TCT^GGGAACATCATTATCCTGTTTGTGATACAGACTGAACAGAGCCTCCACCAACCCATG 
TTTTACTTCCTAGCCATGTTGGCCGGCACTGATCTGGGCTTGTCTACAGCAACCATCCCCA 
AGATGCTGGGAATTTTCTGGTTTAATCTTGGAGAGATTGCATTTGGTGCCTGCATCACACA 
GATGTATACCATTCATATATGCACTGGCCTGGAGTCTGTGGTACTGACAGTCACGGGCATA 
GATCGCTATATTGCCATCTGCAACCCCCTGAGATATAGCATGATCCTTACCAACAAGGTAA 

40 TAGCCATTCTGGGCATAGTCATCATTGTCAGGACTTTGGTATTTGTGACTCCATTCACATTT 
CTCACCCTGAGATTGCCTTTCTGTGGTGTCCGGATTATCCCTCATACCTATTGTGAACACAT 
GGGCTTGGCAAAGTTAGCTTGTGCCAGTATTAATGTTATATATGGATTGATTGCCTTCTCA 
GTGGGATACATTGACATTTCTGTGATTGGATTTTCCTATGTCCAGATCCTCCGAGCTGTCTT 
CCATCTCCCAGCCTGGGATGCCCGGCTTAAGGCACTCAGCACATGTGGCTCTCACGTCTGT 

45 GTTATGTTGGCTTTCTACCTGCCAGCCCTCrTTTCCTTCATGACACACCGCTTTGGCCACAA 

CATCCCTCATTACATCCACATTCTTCTGGCCAATCTGTATGTGGTTTTTCCCCCTGCTCTTA 

ACTCTGTTATCTATGGGGTCAAAACAAAACAGATACGAGAGCAGGTACTTAGGATACTCA 

ACCCTAAAAGCTTTTGGCATTTTGACCCCAAGAGGATCTTCCACAACAATTCAGTTAGACA 
ATAA (SEQ ID NO: 490) 

50 

AOLFR112B sequences: 

MKhflCTVLTEFILLGLTDVPELQVAVFTFLFLAY^ 
LEISFTMFTPRVIJSITTGNKSISFAGCITQYFFAMFLGATEFYL^ 

MSSRJaQLIFCSWLGGLMAHPTITLMSQQDFCASNRLNHYFCDYEPLLELSCSDTSLIEKVVFL 
55 VASVTLVVTLVLVILSYAFlIKTILKLPSAQQRTKAFSTCSSHMrVISI^YGSCMFMYIOT 
DTFNKGVALLITSVAPLLNPRYTLRNQQVKQPFKDMVKKLLNL (SEQ ID NO: 491) 

167 



NSOOCID; <WO_0198526A2_I_> 



WO 01/98526 



PCT/US01/20122 



GAGGATCCAGTAATGGGGAGGGCCGCAGACCGACTTACTCTTCCTGCTCCTTCCCACCTTT 
GCATGGTGGGACTCCTTTTTGGCAGCACCATGGTCATGTACATGGCCCCCAAGTCCCGCCA 
CCCTGAGGAGCAGCAGAAGGTCCTTTCCCTGTTTTACAGCCTTTTCAACCCGATGCTGAAC 
CCCTTGATCTACAGCCTGAGGAACGCAGAGGTCAAGGGTGCCCTGAAAAGAGTGTTGTGG 

5 AAACAGAGATCAAAGTGA (SEQ ID NO: 482). 
AOLFR257 sequences: 

MESNQTWITEV1LLGFQVDPALELFLFGFFLLFYSLTLMGNGIILGLIYLDSRLHTPMYVFLSHL 
AIVDMSYASSTVPKMLANLVMHKKVISFAPCILQTFLYLAFAITECLILVMMCYDRYVAICHPL 
1 0 OYTLIMNWRVCriVLASTCWIFSFLLALVHITLILPXPFCGPQKINHFFCQIMSVFKLAC 

OWLFAGSAFILVGPLCLVLVSYLHILVAILPJQSGEGRPJCAFSTCSSHLCVVGLFFGSAIVMYM 
APKSSHSQERRKILSLFYSLFNPILNPLIYSLRNAEVKGALKRVLWKQRSM (SEQ ID NO: 483). 

ATGGAAAGCAATCAGACCTGGATCACAGAAGTCATCCTGTTGGGATTCCAGGTGGACCCA 
1 5 GCTCTGGAGTTGTTCCTCTTTGGGTTTTTCTTGCTATTCTACAGCTTAACCCTGATGGGAAA 
TGGGATTATCCTGGGGCTCATCTACTTGGACTCTAGACTGCACACAGCCATGTATGTCTTC 
CTGTCACACCTGGCCATTGTGGACATGTCCTATGCCTCGAGTACTGTCCCTAAGATGCTAG 
CAAATCTTGTGATGCACAAAAAAGTCATCTCCTTTGCTCCTTGCATACTTCAGACTTTTTTG 
TATTTGGCGTTTGCTATTACAGAGTGTCTGATTTTGGTGATGATGTGCTATGATCGGTATG 
20 TGGCAATCTGTCACCCCTTGCAATACACCCTCATTATGAACTGGAGAGTGTGCACTGTCCT 
GGCCTCAACTTGCTGGATATTTAGCTTTCTCTTGGCTCTGGTCCATATTACTCTTATTCTGA 
GGCTGCCTTTTTGTGGCCCACAAAAGATCAACCACTTTTTCTGTCAAATCATGTCCGTATTC 
AAATTGGCCTGTGCTGACACTAGGCTCAACCAGGTGGTCCTATTTGCGGGTTCTGCGTTCA 
TCTTAGTGGGGCCGCTCTGCCTGGTGCTGGTCTCCTACTTGCACATCCTGGTGGCCATCTTG 
25 AGGATCCAGTCTGGGGAGGGCCGCAGAAAGGCCTTCTCTACCTGCTCCTCCCACCTCTGCG 
TGGTGGGGCTTTTCTTTGGCAGCGCCATTGTCATGTACATGGCCCCCAAGTCAAGCCATTC 
TCAAGAACGGAGGAAGATCCrTTCCCTGTTTTACAGCCTTTTCAACCCGATCCTGAACCCC 
CTCATCTACAGCCITAGGAATGCAGAGGTGAAAGGGGCTCTAAAGAGAGTCCTTTGGAAA 

CAGAGATCAATGTGA (SEQ ID NO: 484). 

30 

AOLFR259 sequences: 

MGDNQSR\TTEFILVGFQLSVEMEVLLFWIFSLLYLFSLLANGMILGLICLDPRLRTPMYFFLSHL 
AVIDIYYASSNLLNMLENLVKHKKTISFISCIMQMALYLTFAAAVCM1LWMSYDRFVAICHPL 
HYTVIMNWRVCTVLAITSWACGFSIALINLILLLRLPFCGPQEVNHFFGEILSVLKIACAD 
35 EffVFAGGVFVLVGPLSLMLlSYMWLLAILKIQSKEGROCAFSTCSSHLCWGLYFG^4AMVVY 
LWDNSQRQKQQK1LTLFYSLFNPLLNPLIYSLRNAQVKGALYRALQKKRTM (SEQ ID NO: 

485). 

ATGGGGGACAACCAATCACGGGTCACAGAATTCATCCTGGTTGGATTCCAGCTCAGTGTG 
40 GAGATGGAAGTGCTCCTCTTCTGGATCTTCTCCCTGTTATATCTCTTCAGCCTGCTGGCAAA 
TGGCATGATCTTGGGGCTCATCTGTCTGGATCCCAGACTGCGCACCCCCATGTACTTCTTCC 
TGTCACACTTGGCCGTCATTGACATATACTATGCTTCCAGCAATTTGCTCAACATGCTGGA 
AAACCTAGTGAAACACAAAAAAACTATCTCGTTCATCTCTTGCATTATGCAGATGGCTTTG 
TATTTGACTTTrGCTGCTGCAGTGTGCATGATTTTGGTGGTGATGTCCTATGACAGATTTGT 
45 GGCGATCTGCCATCCCCTGCATTACACTGTCATCATGAACTGGAGAGTGTGCACAGTACTG 
GCTATTACTTCCTGGGCATGTGGATTTTCCCTGGCCCTCATAAATCTAATTCTCCTTCTAAG 
GCTGCCCTTCTGTGGGCCCCAGGAGGTGAACCACTTCTTCGGTGAAATTCTGTCTGTCCTC 
AAACTGGCCTGTGCAGACACCTGGATTAATGAAATTrrTGTCTTTGCTGGTGGTGTGTTTG 
TCTTAGTCGGGCCCCTTTCCTTGATGCTGATCTCCTACATGCGCATCCTCTTGGCCATCCTG 
50 AAGATCCAGTCAAAGGAGGGCCGCAAAAAAGCCTTTTCCACCTGCTCCTCCCACCTCTGTG 
TGGTTGGGCTTTACTTTGGCATGGCCATGGTGGTTTACCTGGTCCCAGACAACAGTCAACG 
ACAGAAGCAGCAGAAAATTCrCACCCTGTTTTACAGCCTTTTCAACCCATTGCTGAACCCC 
CTCATCTACAGCCTGCGGAATGCTCAAGTGAAGGGTGCCTTATACAGAGCACTGCAGAAA 

AAGAGGACCATGTGA (SEQ ID NO: 486). 

55 
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ACCATGATTGGTTCTGAGTTCTTCCTCCTGGGCCTCATGGCCTATGACTGCTACGTGGCTGT 
CTGTAACCCTCTGAGATACCCAGTCCTGATGAACCGCAAGAAGTGTCTTTTGCTGGCTGCT 
GGTGCCTGGTTTGGGGGCTCCCTCGATGGCTTTCTGCTCACTCCCATCACCATGAATGTCC 
CrTACTGTGGCTCCCGAAGTATCAACCATTTTTTCTGTGAGATCCCAGCAGTTCTGAAACT 
> GGCCTGTGCAGACACGTCCTTGTATGAAACTCTGATGTACATCTGCTGTGTCCTCATGTTG 
CTCATCCCCATCTCTATCATCTCCACTTCCTACTCCCTCATCTTGTTAACCATCCACCGCAT 
GCCCTCTGCTGAAGGTCGCAAAAAGGCCTTCACCACTTGTTCCTCCCACTTGACTGTAGTT 
AGCATCTTCTATGGGGCTGCCTTCTACACATACGTGCTGCCCCAGTCCTTCCACACCCCCG 
AGCAGGACAAAGTAGTGTCAGCCTTCTATACCATTGTCACGCCCATGCTTAATCCTCTCAT 
CTACAGCCTCAGAAACAAGGACGTCATAGGGGCATTTAAAAAGGTATTTGCATGTTGCTrA 
TCTGCTCAGAAAGTAGCAACAAGTGATGCTTAG (SEQ ID NO: 478). 

AOLFR255 sequences: 

MEQSNYSWADFILLGLFSNARFPWLLFALIIXWLTSlAS>maaiLmroSRLHTPMYFLLSOLS 

LRDILYISTIVPKMLVDQVMSQRAJSFAGCTAQHFLYLTLAGAEFFLLGLMSYDRYVAJCNPLH 

YPVLMSRKICWLIVAAAWLGGSroGFLLTPVTMQFPFCASREINHFFCEVPALLKLSCTDTSAY 

ETAMYVCCIMMLLIPFSVISGSYTRILITVYRMSEAEGRGKAVATCSSHM 

™P H ^TPEQDK^VSAFYTn.TPMLNPLIYSLRNKDVTGAJLQKWGRCVSS 

ATGGAGCAGAGCAATTATTCCGTGTATGCCGACTTTATCCTTGTGGGTTTGTTCAGCAACG 

CCCGTTTCCCCTGGCTrCTCTrTGCCCTCATTCTCCTGGTCTTTTTGACCTCCATAGCCAGC 

AACGTGGTCAAGATCATTCTCATCCACATAGACTCCCGCCTCCACACCCCCATGTACTTCCT 

GCTCAGCCAGCTCTCCCTCAGGGACATCCTGTATATTTCCACCATTGTGCCCAAAATGCTG 

GTCGACCAGGTGATGAGCCAGAGAGCCATTTCCTTTGCTGGATGCACTGCCCAACACTTCC 

TCTACTTGACCTTAGCAGGGGCTGAGTTCTTCCTCCTAGGACTCATGTCCTATGATCGCTAC 

GTAGCCATCTGCAACCCTCTGCACTATCCTGTCCTCATGAGCCGCAAGATCTGCTGGTTGA 

TTGTGGCGGCAGCCTGGCTGGGAGGGTCTATCGATGGTTTCTTGCTCACCCCCGTCACCAT 

GCAGTTCCCCrrTCTGTGCCTCrCGGGAGATCAACCACTTCTTCTGCGAGGTGCCTGCCCTTC 

TGAAGCTCTCCTGCACGGACACATCAGCCTACGAGACAGCCATGTATGTCTGCTGTATTAT 

GATGCTCCTCATCCCTTTCTCTGTCATCTCGGGCTCTTACACAAGAATTCTCATTACl'GTTT 

ATAGGATGAGCGAGGCAGAGGGGAGGGGAAAGGCTGTGGCCACCTGCTCCTCACACATGG 

TGGTTGTCAGCCTCTTCTATGGGGCTGCCATGTACACATACGTGCTGCCTCATTCTTACCAC 

ACCCCTGAGCAGGACAAAGCTGTATCTGCCTTCTACACCATCCTTACTCCCATGCTCAATC 

CACTCATTTACAGCCTTAGGAACAAGGATGTCACAGGGGCCCTACAGAAGGTTGTGGGGA 

GGTGTGTGTCCTCAGGAAAGGTAACCACTTTCTAA (SEQ ID NO: 480). 

AOLFR2S6 sequences: 

MGGKQPWVTEFILVGFQVGPAI^ILLCGLFSVFYTLTLLGNGVIFGnCLDSKLHTPMYFFLSHL 

AIlDMSYASNhTVFKMLANLNINQKSTISFVnPCIMQTFLYLAFAVTECLILVVMSYDRYV 

QYTVTMSWVCTILASTCWHSFLMADVHI^^ 

VVLYAGSAFTVEGPLCLELVS>nLHILSRHLEDPVMGRAADR^ 

Yf^KSRHPEEQQKVLSLFYSLFNPMLNPLIYSLRNAEVKGALKRVLWKQRSK (SEQ ID NO: 

ATGGGAGGCAAGCAGCCCTGGGTCACAGAATTCATCCTGGTGGGATTCCAGGTTGGTCCA 

GCACTGGCGATTCTCCTCTGTGGACTCTTCTCTGTCTTCTATACACTCACCCTGCTGGGGAA 

TGGGGTCATCTTTGGGATTATCTGCCTGGACrCTAAGCTTCACACACCCATGTACTTCTTCC 

TCTCACACCTGGCCATCATTGACATGTCCTATGCTTCCAACAATGTTCCCAAGATGTTGGC 

AAACCTAATGAACCAGAAAAGCACCATCTCCTTTGTTCCATGCATAATGCAGACTTTTTTG 

TATTTGGCTTTTGCTGTTACAGAGTGCCTGATTTTGGTGGTGATGTCCTATGATAGGTATGT 

GGCCATCTGCCACCCTTTCCAGTACACTGTCATCATGAGCTGGAGAGTGTGCACGATCCTG 

GCCTCAACATGCTGGATAATTAGCTTTCTCATGGCTCTGGTCCATATAACTCATATTCTGAG 

GCCGCCTTTTTGTGGCCCACAAAAGATCAACCACTTTATCTGTCAAATCATGTCCGTATTCA 

AATTGGCCTGTGCTGGCCCTAGGCTCAACCAGGTGGTCCTATATGCGGGTTCTGCGTTCAT 

CGTAGAGGGGCCGCTCTGCCTGGAGCTGGTCTCCAACTTGCACATCCTGTCGCGCCATCTT 
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ATGCGGCTGGCCAACCAGACCCTGGGTGGTGACTTTTTCCTGTTGGGAATCTTCAG^GA 
TCTCACACCCTGGCCGCCTCTGCXrGCITATCnrTCAGTATATTTTTGATGGCTGTGTCTTGG 
AATATTACATTGATACTTCTGATCCACATTGACTCCTCTCTGCATACTCCCATGTACTTCTT 
TATAAACCAGCTCTCACTCATAGACTTGACATATATTTCTGTCACTGTCCCCAAAATGCTG 
5 GTGAACCAGCTGGCCAAAGACAAGACCATCTCGGTCCTTGGGTGTGGCACCCAGATGTAC 
TTCTACCTGCAGTTGGGAGGTGCAGAGTGCTGCCTTCTAGCCGCCATGGCCTATGACCGCT 
ATGTGGCTATCTGCCATCCTCTCCGTTACTCTGTGCTCATGAGCCATAGGGTATGTCTCCTC 
CTGGCATCAGGCTGCTGGTTTGTGGGCTCAGTGGATGGCTTCATGCTCACTCCCATCGCCA 
TGAGCTTCCCCTTCTGCAGATCCCATGAGATTCAGCACTTCTTCTGTGAGGTCCCTGCTGTT 

10 ^GAAGCTCTCTTGCTCAGACACCTCACTTTACAAGATTTTCATGTACTTGTGCTGTGTCAT 
CATGCTCCTGATACCTGTGACGGTCATTTCAGTGTCTTACTACTATATCATCCTCACCATCC 
ATAAGATGAACTCAGTTGAGGGTCGGAAAAAGGCCTTCACCACCTGCTCCTCCCACATTAC 
AGTGGTCAGCCTCTTCTATGGAGCTGCTATTTACAACTACATGCTCCCCAGCTCCTACCAA 
ACTCCTGAGAAAGATATGATGTCATCCTTTTTCTACACTATCCrrTACACCTGTCTTGAATCC 

1 5 TATCATTTACAGTTTCAGGAATAAGGATGTCACAAGGGCTrTGAAAAAAATGCTGAGCGT 
GCAGAAACCTCCATATTAA (SEQ ID NO: 474). 

AOLFR253 sequences: 

MTFFSSGGNCEPVMCSGNQTSQNQTASTDFTLTGLFAESKHAALLYTVTFLLFLMALTGNALL 
70 IT J THSEPRLHTPMYFFISQLALMDLMYLCVTVPKMLVGQVTGDDTISPSGCGIQMFFHLTLAG 

aevflIaa^ 

RKILSFFCETPALLIQ^CSDVSLYKMLTYLCCILMIXTPIMVISSSYTLILHLIHPJvrNSAAGP^KA 
LATCSSHMIIVLLLFGASFYTYMLRSSYHTAEQDMMVSAFYTIFTPVLNPLIYSLRNKDVTRAL 

RSMMQSRMNQEK (SEQ ID NO: 475). 

25 

ATGACrTTTTTTTCCTCAGGGGGAAACTGTGAGCCAGTCATGTGCTCAGGGAATCAGACTr 
CTCAGAATCAAACAGCAAGCACrGATTTCACCCTCACGGGACTCTTTGCTGAGAGCAAGCA 
TGCTGCOCTOCTCTACACCGTGACCTTCCTTCTTTTCT^ 

TCCTCATCCTCCTCATCCACTCAGAGCCCCGCCTCCACACCCCCATGTACTTCTTCATCAGC 
30 CAGCTCGCGCTCATGGATCTCATGTACCTATGCGTGACTGTGCCCAAGATGCrTGTGGGCC 
AGGTCACTGGAGATGATACCATTTCCCCGTCAGGCTGTGGGATCCAGATGTTCTTCCACCT 
GACCCTGGCTGGAGCTGAGGTTTTCCTCCTGGCTGCCATGGCCTATG 

G^GCAGACCTCTCCATTACCCACTGCTGATGAACCAGAGGGTGTGCCAGCTCCTGGTCT 
CAGCCTGCTGGGTTTTGGGAATGGTTGATGGTTTGTTGCTCACCCCCATTACCATGAGCTT 

^5 CCCCrTTTGCCAGTCTAGGAAAATCCTGAGTTTTTTCTGTGAGACTCCTGCCCrrGCTGAAGC 
TCTC^CTGCTCTGACGTCTCCCTCTATAAGATGCTCACGTACCTGTGCTGCATCCTCATGCTT 
CTCACCCCCATCATGGTCATCTCCAGCTCATACACCCTCATCCTGCATCTCATCCACAGGAT 
GAATTCTGCCGCCGGCCGCAGGAAGGCCTTGGCCACCTGCTCCTCCCACATGATCATAGTG 
CTGCTGCTCTTCGGTGCTTCCTTCTACACCTACATGCTCCGGAGTTCCTACCACACAGCTGA 

40 GCAGGACATGATGGTGTCTGCCTTTTACACCATCTTCACTCCTGTGCTGAACCCCCTCAm 
ACAGTCTCCGCAACAAAGATGTCACCAGGGCTCTGAGGAGCATGATGCAGTCAAGAATGA 

ACCAAGAAAAGTAG (SEQ ID NO: 476). 

AOLFR254 sequences: 

45 MTOTSSSDFTLLGLLVNSEAAGIVFTVILAVFLGAVT^ 

^TLFlCTTWKLLADMVSKEKnSFVACGIQIFLYLTMIGSEFFLLGLMAYDCYVAVCNPLRYP 

VLMNP^CLLLAAGAWFGGSLDGFLLTPITMNWYCG 
IMYICC^IMIXmsnSTSYSLILLTIHRMPSAEGRKKAFIT 

FHTPEQDKWSAFYTIVTPMLNPLIYSLRNKDV1GAFKKVFACCSSAQKVATSDA (SEQ ID NO: 
50 477). 

ATGACGAACACATCATCCTCTGACTTCACCCTCCTGGGGCTTCTGGTGAACAGTGAGGCTG 

CCGGGATTGTATTTACAGTGATGCT^ 

CATGATATTCTTGATTCAGGTGGACTCT 

55 AGCTGTCCATCATGGACAC 

GGT^CTAAAGAGAAGATCATTTCCTTTGTGGCCTGTGGCATCCAGATCTTC 
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AOLFR250 sequences: 

MENQSSISEFFLRGISAPPEQQQSLFGIFLCMYLVTLTGNLU 

DMGLTSSTVTKMLVN1QTRHHT1SYTGCLTQMYFFLMFGDLDSFFLAAMAYDRYVAICHPLCY 
STVMPJ>QVCALMLALCWVLTNIVALTH^^ 

5 MMVFVLGGTVLIWFLCIVTSYmiWAILRVRTRGGVGKAFSTCSSHLCWCVFYGTLFSAYLC 
PPSIASEEKDIAAAAMYTIVTPMLNPnYSLRNKDMKGALKRLFSHRSIVSS (SEQ ID NO: 469). 

ATGGAAAACCAATCCAGCATTTCTGAATTTTTCCTCCGAGGAATATCAGCGCCTCCAGAGC 
AACAGCAGTCCCTCTTCGGAATTTTCCTGTGTATGTATCTTGTCACCTTGACTGGGAACCTG 

1 0 CTCATCATCCTGGCCATTGGCTCTGACCTGCACCTCCACACCCCCATGTACl 1 nTCTTGGC 
CAACCTGTC ITU GTTG ACATGGGTTTAACGTCCTCCACAGTTACCAAGATGCTGGTGAAT 
ATACAGACTCGGCATCACACCATCTCCTATACGGGTTGCCTCACGCAAATGTATTTCTTTCT 
GATGTTTGGTGATCTAGACAGCTTCTTCCTGGCTGCCATGGCGTATGACCGCTATGTGGCC 
ATTTGCCACCCCCTCTGCTACTCCACAGTCATGAGGCCCCAAGTCTGTGCCCTAATGCTTGC 

1 5 ATTGTGCTGGGTCCTCACCAATATCGTTGCCCTGACTCACACGTTCCTCATGGCTCGGTTGT 
CCTTCTGTGTGACTGGGGAAATTGCTCACTTTTTCTGTGACATCACTCCTGTCCTGAAGCTG 
TCATGTTC TGACA CCCACATCAACGAGATGATGGTTTTTGTCTTGGGAGGCACCGTACTCA 
TCGTCCCCTTTTTATGCATTGTCACCTCCTACATCCACATTGTGCCAGCTATCCTGAGGGTC 
CGAACCCGTGGTGGGGTGGGCAAGGCCTTTTCCACCTGCAGTTCCCACCTCTGCGTTGTTT 

20 GTGTGTTCTATGGGACCCTCTTCAGTGCCTACCTGTGTCCTCCCTCCATTGCCTCTGAAGAG 

AAGGACATTGCAGCAGCTGCAATGTACACCATAGTGACTCCCATGTTGAACCCCTTTATCT 

ATAGCCTAAGGAACAAGGACATGAAGGGGGCCCTAAAGAGGCTCTTCAGTCACAGGAGTA 
TTGTTTCCTCTTAG (SEQ ID NO: 470). 

25 AOLFR251 sequences: 

MEGNKTWITDITLPRFQVGPALEILLCGIJSAFYrLTLLGNGVIFGnCLDCKl«TPMYFFI^HLA 
rVDISYASNYVPKMLTNLMNQESTISFFPCIMQTFLYLAFAHVECLILV\^SYDRYADICHPLRY 
NILMSWRVCTVLAVASWVFSFLLALWLVLILRLPFCGPIJEINHFCEIL^VLKLACADTWLNQV 
VIFAACVFILVGPLCLVLVSYLPJLAAILRIQSGEGPJII<AFSTCSSHLCWGLFFGSAIVTYMAPK 
30 SRHPEEQQKVLSLFfSIJFNPMLNPLIYSLPJNAEVKGALRRAIJUCERLT (SEQ ID NO: 471). 

ATGGAAGGCAACAAGACATGGATCACAGACATCACCTTGCCGCGATTCCAGGTTGGTCCA 

GCACTGGAGATTCTCCTCTGTGGACTTTTCTCrGCCTTCTATACACTCACCCTGCTGGGGAA 

TGGGGTCATCTTTGGGATTATCTGCCTGGACTGTAAGCTTCACACACCCATGTACTTCTTCC 

35 TCTCACACCTGGCCATTGTTGACATATCCTATGCTTCCAACTATGTCCCCAAGATGCTGACG 

AATCTTATGAACCAGGAAAGCACCATCTCCTTTTTTCCATGCATAATGCAGACATTCTTGT 

ATTTGGCTTTTGCrCACGTAGAGTGTCTGATTTTGGTGGTGATGTCCTATGATCGCTATGCG 

GACATCTGCCACCCCTTACGTTACAATATCCTCATGAGCTGGAGAGTGTGCACTGTCCTGG 

CTGTGGCTTCCTGGGTGTTCAGCTTCCTCCTGGCTCTGGTCCCTITAGTTCTCATCCTGAGG 

40 CTGCCCTTCTGCGGGCCTCATGAAATCAACCACTTCTGTGAAATCCTGTCTGTCCTCAAGTT 

GGCCTGTGCTGACACCTGGCTCAACCAGGTGGTCATCTTTGCAGCCTGCGTGTTCATCCTG 

GTGGGGCCACTCTGGCTGGTGeTGGTCTCCTACTTGCGCATCCTGGCCGCCATCTTGAGGA 

TCCAGTCTGGGGAGGGCCGCAGAAAGGCCTTCTCCACCTGCTCCTCCCACCTrTGCGTGGT 

, GGGACTCITCTTTGGCAGCGCCATTGTCACGTACATGGCCCCCAAGTCCCGCCATCCTGAG 

45 GAGCAGCAGAAAGTTCTTTCCCTGTTTTACAGCCTTTTCAATCCAATGCTGAACCCCCTGA 

TATATAGCCTAAGGAATGCAGAGGTCAAGGGCGCCCTGAGGAGGGCACTGAGGAAGGAG 
AGGCTGACGTGA (SEQ ID NO: 472) 



AOLFR252 sequences: - 

50 MRLANQTLGGDFFLLGIFSQISHPGRLCLLffSIFLMAVSWNITLnLLIHroSSLHTPMYFFINQLSL 
IDLTYISVTVPKMLVNQLAIO)KTISVLGCGTQMYFYLQLGGAE^ 

YSVLMSHRVCLLIA.SGCWFVGSVDGFMLTPIAMSFPFCRSHEIQHFFCEWAVLKLSCSDTSLY 
KIFMYLCCNHMLLIPVTVISVSYYYTILTIHKMNSVEGRKKAFr^ 

PSSYQTPEKDMMSSFFYTILTPVLNPirYSFPJsrKDVTP^KKMI^V (SEQ ID NO: 473) 
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GATACTGATAAAATGGCTTCTGTGTTTTACACTTTAGTAATCCCCATGCnTAACCCTTTGAT 
TTACAGCTTAAGAAACGAAGAGGTGAAAAATGCCTTCTATAAGCTCTTTGAGAATTGA 

(SEQ ID NO: 464). 

5 AOLFR248 sequences: 

MPCMPCALPTGGLLPHPQHTMMEIANVSSPEVFVLLGFSTRPSLETVLFIVVLSFYMVSILGNGI 
IILVSHTDVHLHTPMYFFLANLPFLDMSFTTSIVPQLLANLWGPQKTISYGGCVVQFYISHWLG 
ATECVLLATMSYDRYAAICRPLHYTVIMHPQLCLGLALASWLGGLTTSMVGSTLTMLLPLCG 
NNCroHFFCEMPLIMQLACVDTSLNEMEMYLASFVFVVLPLGLILVSYGHIARAVLKIRSAEGR 
1 0 RKAFNTCSSHVAVVSLFYGS1IFMYLQPAKSTSHEQGKF1ALFYT\^TPALNPL1YTLRNTE 
ALRHMVLENCCGSAGKLAQI (SEQ ID NO: 465). 

ATGCCCTGTATGCCCTGTGCTCTTCCCACAGGTGGCCrrTTTGCCCCACCCCCAGCATACAAT 
GATGGAAATAGCCAATGTGAGTTCTCCAGAAGTCTTTGTCCTCCTGGGCTTCTCCACACGA 

1 5 CCCTCACTAGAAACTGTCCTCTTCATAGTTGTCTTGAGTTTTTACATGGTATCGATCTTGGG 
CAATGGCATCATCATTCTGGTCTCCCATACAGATGTGCACCTCCACACACCTATGTACTTCT 
TTCTTGCCAACCTCCCCTTCCTGGACATGAGCTTCACCACGAGCATTGTCCCACAGCTCCTG 
GCTAACCTCTGGGGACCACAGAAAACCATAAGCTATGGAGGGTGTGTGGTCCAGTTCTAT 
ATCTCCCATTGGCTGGGGGCAACCGAGTGTGTCCTGCTGGCCACCATGTCCTATGACCGCT 

20 ACGCTGCCATCTGCAGGCCACTCCATTACACTGTCATTATGCATCCACAGCTTTGCCTTGG 
GCTAGCTTTGGCCTCCTGGCTGGGGGGTCTGACCACCAGCATGGTGGGCTCCACGCTCACC 
ATGCTCCTACCGCTGTGTGGGAACAATTGCATCGACCACTTCTTTTGCGAGATGCCCCTCA 
TTATGCAACTGGCTTGTGTGGATACCAGCCTCAATGAGATGGAGATGTACCTGGCCAGCTT 
TGTCTTTGTTGTCCTGCCTCTGGGGCTCATCCTGGTCTCTTACGGCCACATTGCCCGGGCCG 

25 TGTTGAAGATCAGGTCAGCAGAAGGGCGGAGAAAGGCATTCAACACCTGTTCTTCCCACG 
TGGCTGTGGTGTCTCTGTTTTACGGGAGCATCATCTTCATGTATCTCCAGCCAGCCAAGAG 
CACCTCCCATGAGCAGGGCAAGTTCATAGCTCTGTTCTACACCGTAGTCACTCCTGCGCTG 
AACCCACTTATTTACACCCTGAGGAACACGGAGGTGAAGAGCGCCCTCCGGCACATGGTA 
TTAGAGAACTGCTGTGGCTCTGCAGGCAAGCTGGCGCAAATTTAG (SEQ ID NO: 466). 

30 

AOLFR249 sequences: 

MKSQmKSDLKYRAILLQKVTRMFLLFWVLLLVl^RLLVVMGRGNSTEVTEFHLLGFGVQHEF 
QHVLFIVLLLryVTSLIGMGMELLIKTDSRLQTPMYFFPQHLAFVDIGYTSAITPKMLQSFTEEN 
NIJTFRGCVIQFLVYATFATSDCYLLAIMAMDCYVAICKPLRYPM]MSQTVYIQLVAGSYIIGSI 
3 5 NASVHTGFTFSLSFCKSNKINHFFCDGLPILALSCSNroiNIILDVVFVGFDLMFTELVHFSYn'IM 
VTILK1VISSTAGRKKSFSTCASHLTAVTIFYGTLSYMYLQPQSNNSQENMKVASIFYGTVIPMLN 

PLIYSLRNKEGK (SEQ ID NO: 467). 

ATGAAAAGTCAAATTGAAAAAAGTGACTTAAAATATAGAGCCATTTTATTGCAAAAAGTC 

40 ACAAGGATGTTCCrGCTTTTCTGGGTCCTTCTCTTGGTCCTTTCTAGACTTI^ 

GGGTCGAGGAAACAGCACTGAAGTGACTGAATTCCATCTTCTGGGATTTGGTGTCCAACAC 
GAATTrCAGCATGTCCTTTTCATTGTACTTCTTCTTATCTATGTGACCTCCCTGATAGGAAA 
TATTGGAATGATCITACrCATCAAGACCGATTCCAGACTTCAAACACCCATGTACrTTTTTC 
CACAACATTTGGCTTTTGTTGATATCTGTTATACTTCTGCTATCACTCCCAAGATGCTCCAA 

45 AGCITCACAGAAGAAAATAATTTGATAACATTTCGGGGCTGTGTGATACAATTCTTAGTTT 
ATGCAACATTTGCAACCAGTGACTGTTACCTCCTAGCTATTATGGCAATGGATTGTTATGT 
TGCCATCTGTAAGCCCCTTCGCTATOCCATGATCATGTCCCAAACAGTCTACATCCAACTCG 
TAGCTGGCTCATATATTATAGGCTCAATAAATGCCrCrGTACATACAGGTTTTACATTTTCA 
CTGTCCTTCTGCAAGTCTAATAAAATCAATCACITTTTCTGTGATGGTCTCCCAATTCTTGC 

50 CCTTTCATGCrCCAACATTGACATCAACATCATTCrAGATGTTGTCTTTGTGGGATrTGACT 
TGATGTrCACTGAGTTGGTCATCATCTTTTCCTACATCTACATTATGGTCACCATCCTGAAG 
ATGTCTTCTACTGCTGGGAGGAAAAAATCCTTCTCCACATGTGCCTCCCACCTGACAGCAG 
TAACCATTTTCTATGGGACACTCTCTTACATGTACTTACAGCCTCAGTCTAATAATTCTCAG 
GAGAATATGAAAGTAGCCTCTATATTTTATGGCACrGTTATTCCCATGTTGAATCCTTTAAT 

55 CTATAGCTTGAGAAATAAGGAAGGAAAATAA (SEQ ID NO: 468). 
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GTAGCCATATGTAAACCCCTGCACTATAGGACAATCATGAGCCACAAGCTGCTAAAGGGG 
TTTGCGATACTrTCATGGATAATTGGTTTTTTACACTCCATAAGCCAGATAGTTTTAACAAT 



J 1 1 '^aiuuaiaai 1 uuri 1 TTTACACTCCATAAGCCAGATAGTTTTAACAAT 
GAACTTGCCTTTCTGTGGCCACAATGTCATAAACAACATATTTTGTGATCTTCCCCTTGTGA 
TCAAGCTTGCTTGCATTGAAACATACACCCTGGAATTATTTGTCATTGCTGACAGCGGGCT 
GCTCTCTTTCACCTGTTTCATCCTCTTGCTTGTTTCTTACATTGTCATCCTGGTCAGTGTACC 
AAAAAAATCATCACATGGGCTCTCCAAGGCGCTGTCCACATTGTCTGCCCACATCATTGTG 

CAATAAAACTCTrGCCGTATTTTATACAGTrATCACACCCTTACTGAATCCGAGTATITATA 

CCCTGAGAAATAAGAAAATGCAAGAGGCCATAAGAAAATTACGGTTCCAATATGTTAGTT 
CTGCACAGAATTTCTAG (SEQ ID NO: 460). 1 AU1 1 



AOLFR246 sequences: 

MSPENQSSVSEFLLLGLPIRPEQQAVFFTLFLGMYLTTVLG^ 

LALTDISFSSVTWKMLMDMRTKYKSILYEEC1SQMYFF1FFTDLDSFUTSMAYDRYVAICHPL 
^VIMREELCWLVAVSWILSCASSLSHTLLLTRLSFCAANTEPHVFCDLAALLKLSCSDIFLNE 
LVMFTVGVWITLPFMCILVSYGYIGATILRWSTKGIHKALSTCGSHLSVVSLYYGSIFGOYLF 
P^SSSIDKDVIVALMYTVVTPMLNfPFIYSLRNRDMKEALGKLFSRATFFSW (SEQ ID NO: 

ATGAGCCCTGAGAACCAGAGCAGCGTGTCCGAGTTCCTCCTTCTGGGCCTCCCCATCCGGC 
CAGAGCAGCAGGCTGTGTTCTTCACCCTGTTCCTGGGCATGTACCTGACCACGGTGCTGGG 
GAACCTGCTCATCATGCTGCTCATCCAGCTGGACTCTCACCTTCACACCCCCATGTACTTCT 

T^ AGCCAOTGGCTCTCACTGACATCTCClmc ^^ 

ATGGACATGCGGACTAAGTACAAATCGATCCTCTATGAGGAATGCATTTCTCAGATGTATT 
TlTTTATAtTTTTTACTC^ 

GTTGCCATATGTCACCCTCTCCACTACACTGTCATCATGAGGGAAGAGCTCTGTGTCTTCTT 

AGTGGCTGTATCITGGATTCTGTCTTGTGCCAGCTCCCTCTCTCACACCCTTCTCCTGACCC 

G ? C JS^ CmCTGTGCTGCGAACACCATCCCCCATGTOT 

TCATTACCCTGCCATTCATGTGTATCCTGGTATCATATGGCTACATTGGGGCCACCATCCTG 

AGGGTCCCTTCAACCAAAGGGATCCACAAAGCATTGTCCACATGTGGCTCCCATCTCTCTG 

TGGTGTCTCTCTATTATGGGTCAATATTTGGCCAGTACCTTTTCCCGACTGTAAGCAGTTCT 

ATTGACAAGGATGTCATTGTGGCTCTCATGTACACGGTGGTCACACCCATGTTGAACCCCT 

TTATCTAGA iP^ AGGAACA ^ 

CAACATTTTTCTCCTTGGTGACATCTGACTTTTTAAAAAATTAG (SEQ ID NO: 462). 
AOLFR247 sequences: 

MGQHNLTVLTEFILMELTRRPELQIPLFGVFLW 

ASVDLGNSTVICPKVLANFVVDRNTISYYACAAQLAFFLMFnSEFFILSAMAYDRYVAICNPLL 
YYVIMSQRECHVLVGIQYLYSTFQALMFTIKIFTLTFCGSNmSHFYCDDVPLLPMLCSNAOEIE 
LL^ILFSVFNLISSFLIVLVSYMLILljyCQMHSAEGRKKAFSTCGSHLTVVVVFyGS 
PNSTHFFDTDKMASVFYTLVIPMLNPLrySLRNEEVKNAFYKLFEN (SEQ ID NO: 463). 

ATGGGCCAACACAATCTAACAGTGCTAACTGAATTCATTCTGATGGAACTCACAAGGCGGC 

CTGA iF rGCAGATTCCCC ^^ 

AACCTAACTATGATCATTTTGACCAAACTGGACTCCCACTTACATACACCTATGTACTTTTC 

TATCAGACATTTGGCTTCTGTTGATCTTGGTAATTCTACTGTCATTTGTCCCAAGGTGCT^ 

CAAATTTTGTTGTGGATCGAAATACTATTTCCTATTATGCATGTGCTGCACAGCTGGCATTC 

^COTATGTTCATTATCAGTGAATTTTTCATCCTGTCAGCCATGGCCTATGACCGCTATGT 

GGCCATTTGTAACCCTCTGCTCTATTATGTTATTATGTCTCAGCGACTGTGTCATGTACTGG 

TCGGCATTCAATATCTCTACAGCACATITCAGGCTCTGATGTrCACTATTAAGAlXrrTACA 

TTGACCTTCTGTGGCTCTAATGTCATCAGTCATTTTTACTGTGATGATGTTCCTTTGCTACC 

TATGCTTTGCTCAAATGCACAGGAAATAGAATTGTTGAGCATACTATTTTCTGTATTTAATT 

TGATCTCCTCCTTTCTGATAGTCITAGTGTCCTACATGTTGATTITGTTAGCTATATGTCAA 

ATGCATTCTGCAGAGGGCAGGAAAAAGGCTTTCTCCACATGTGGTTCCCATTTGACAGTGG 

TGGTTGTGTTCTATGGGTCTCTACTCTTCATGTACATGCAGCCCAATTCCACTCACTTCTTT 



161 



PCT/US01/20122 

WO 01/98526 

AATGCAATCATCATXTCCACCATTGTGCTGGACAGAGCCCTTCATACTCCCATGTACTTCTT 
CCTTGCCATCCTTTCTTGCTCTG 

T?GACCTGCTGTCCCAGAAGAAGACCATITCTTrCCTGGGCTGTGCCA 

5 GGCCATCTGTAACCCACTGCGCTACTCAGTGCTCATGGGACATGGGGTGTGTATGGGACTA 
ATGGCTGCTGCCTGTGCCTGTGGCTTCACTGTCTCCCTGGTCACCACCTCCCTAGTATTTCA 
TCTGCCCXTCCACTCCTCCAACCAGCTCCATCACTTCTTCTGTGACATCTCCCCTGTCC^A 
AACTGGCATCTCAGCACTCCGGCTTCAGTCAGCTGGTCATATTCATGCTTGGTGTAT^GC 
C^GGTCATTCCTCTGCTACTTATCCTAGTCTCCTACATCCGCATCATCTCTGCCATTCTAA 
10 AAATCCCTTCCTCCGTTGGAAGATACAAGACCnTCTCCACCTGTGCCTCCCATCTCATrGTG 

G^CTGTTCACTACAGTTC^ 

Sgccaagacaccctaatatctgtgtcatacaccatccttacccca^ 

A^A^GTCTGAG^ 

TTCTATCCTCTTAGTTAA (SEQ ID NO: 456) 

15 

AOLFR244 sequences: 

MWQEYYFLNWFPLLKVCCLTINSHVVILLPWECYHLIWKILPYIGTTVGSMEEYNTSSTO 

MGSRKETSGLIFAlISIIH^ALMANGVMIFLIQTDLPXHTPMYFLLSHLSLro 

L^WRTISFVGCTAQHFLYLTLVGAEFFLLGLM 

20 ™S\WGGSLD^ 

ffF7^S™LTTVQCMSSVEGPJCKAFATCSSH^ 

DKVLS WlLTPMLNPLIYSLRNKDVTGALKRALGRFKGPQRVSGGVF (SEQ ID NO: 457). 

ATGTGGCAAGAATACTATTTTTTAAATGTTTTCTTCCCACTTTTAAAAGTTTGCT 
25 AATT^T^CACATGTTGTTATTTTACTG^ 

^cctVatatcg^gcacaactgtaggatcaatggaagagtacaacacat 

cicScATGGGGCTGTTCAACAGAAAGGAAACCTCAGGTCTTATTTT^ 

ATCATOTCTTCACCGCACTG^^ 
TGCGCCTTCATACACCCATGTACT^ 
30 ImCCACTATTGTGCCTAAGATGCTGGTTAATTACCTGCTGGATCAAAGGACC 

TGTGGGGTGCACAGCTCAACACTTCCTCTACCITACCCT^ 
CTGGGCCTCATGGCCTATGACCGCTATGTGGCCATTTGCAACCCTCTGAGATA 

tca?gJgccgccgggtctgttggatga™ 
tcgc^cctcSaacccccatcaccatga 

35 AC^CTTCTGTGAGGCACCAGCAGTCCTGAAGTTGGCATGTGCAGACACAGCCCrCTACGA 
GA^AGTGATGTATGTGTGCTGTGTTTTGATGCT^ 

C^ATCCCCGAATCCTGACTACAGTTCAGTGCATGAGCTCAGTGGAGGGCAGGAAGAAGG 
CATTTGCCACTTGCrcATCCCACATGACTGTGGTGTCCTT 
AC™ATGCTGCCACAirCTTACCACAAGCCAGCCCA 
40 ACACCA^CTCACACCCATGCTGAACCCCCTCATCTACAGCCITAGAAACAAGGATGTC 

TGGAGCTCTG^AGAGGGCCTTGGGGAGGTTCAAGGGTCCTCAAAGGGTGTCAGGAGGTGT 

CTTTTGA (SEQ ID NO: 458). 



45 



50 



55 



AOLFR245 sequences: 

MDLKNGSLVTEFILLGFFGRWELQIFFFVTFSLIYGATVMGNILI^^ 

L^LDMCLSTATTPKMITOLLTDHKTISVWGCV 

^.HY^TmOTKXLKGFAILSWIIGFLHSISQIVLT^ 

ASNKTLAVFYTVITPLLNPSIYTLRNKKMQEAIRKLRFQYVSSAQNF (SEQ ID NO: 459). 

ATnoATCTTAAAAATGGATCTCTAGTGACCGAGTTTATTTTACTAGGATrTTTTGGACGAT 
GGGAACTT^AAATTTTC^XTTrTGTGA 

CCTTGGAAATCTCTCTTTTTTGGACATGTGTCTCT 

TA^^TTTGCTCACTGACCACAAGACCATCTCTGTGTGGGGCTGCGTGACCCAGATGTTCTT 
CA^GCAGTTCTTreGGGGTGCTGAGATGACTCTTCT 
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LGVMGYDRYAAICHPLHYPTT,MSWQVCGKl^^CAIGGFlASLTVV>n.WSLPFCSANKVNH 
YFCDISA VILLA CTNTDVNEFVTFICGVLVLVVPFLFICVSYLCILRTILKIPSAEGRRKAFSTCAS 

SgSLKLW G (SEQ F ^^45^ 

ATGCCCCAAATTCTTATATTCACATACCTGAATATGTTTTACTTCTTTCCCCC^ 

CTTGGCAGAAAACCTCACCATGGTCACCGAATTCCTGTTGCTGGGTTTTTCCAGCCTTGGT 

GAAATTCAGCTGGCCCTCTTTGTAGTTTTTCTTTTTCTGTATCTAGTCATTCTTAGTGGCAA 

TGTCACCATTATCAGTGTCATCCACCTGGATAAAAGCCTCCACACACCAATGTACTTCTTCC 

TTGGCATTCTCTCAACATCTGAGACCTTCTACACCTTTGTCATTCTACCCAAGATGCTCATC 

AATCTACTTTCTGTGGCCAGGACAATCTCCTTCAACTGTTGTGCTCTTCAAATGTTCTTCTr 

CCTTGGTTTTGCCATTACCAACTGCCTGCTATTGGGTGTGATGGGTTATGATCGCTATGCTG 

CCATTTGTCACCCTCTGCATTACCCCACrCTTATGAGCTGGCAGGTGTGTGGAAAACTGGC 

AGCTGCCTGTGCAATTGGTGGCTTCTTGGCCTCTCTTACAGTAGTAAATTTAGTTTTCAGCC 

TCCCTTTTTGTAGCGCCAACAAAGTCAATCATTACITCTGTGACATCTCAGCAGTCATTCTT 

CTGGCITGTACCAACACAGATGTTAACGAATTTGTGATATTCATTTGTGGAGTTCTTGTAC 

TTGTGGTTCCCTTTCTGTTTATCTGTGTTTCTTATCTCTGCATTCTGAGGACTATCCTGAAG 

ATTCCCTCAGCTGAGGGCAGACGGAAAGCGTTTTCCACCTGCGCCTCTCACCTCAGTGTTG 

TTATTGTTCATTATGGCTGTGCTTCCTTCATCTACCTGAGGCCTACAGCAAACTATGTGTCC 

AACAAAGACAGGCTGGTGACGGTGACATACACGATTGTCACTCCATTACTAAACCCCATG 

GTTTATAGCCTCAGAAACAAGGATGTCCAACTTGCTATCAGAAAAGTGTTGGGCAAGAAA 
GGTTCTCTAAAACTATATAATTGA (SEQ ID NO: 452). 

AOLFR242 sequences: 

MOTTLFHPYSFLLLGIPGLESMHLWVGFPFFAWLTAVLGMTlLFVlQTDSSLHHPMFy^ 
SIDPGLSTSTIPKMLGTFWFTLREISFEGCLTQMFFIHLCTGMESAVLVAMAYDCYVAJCDPLCY 
IXVLTOKWSVMALAlFLRPLWVIPFVLFILRLPFCGHQIIPHTYGEHMGIARLSCASm 
LCAISILVFDIIAIVISYVQILCAVFLLSSHDARLKAFSTCGSHVCVMLTFYMPAFFSFMTHRFGR 
NIPHFmiLLAWYVVIPPALNSVTYGVRTKQIRAQVLKMFFNK (SEQ ID NO: 453). 

ATGAATACCACTCTATTTCATCCTTACTCTTTCCTTCTTCrGGGAATTCCT 

TATGCATCTCTGGGTTGGTTTTCCTTTCTTTGCTGTGTTCCTGACAGCTGTCCTTGGGAATA 

TCACCATCCTTTTTGTGATTCAGACTGACAGTAGTCTCCATCATCCCATGTTCTACTTCCTG 

GCCATrCTGTCATCTATTGACCCGGGCCTGTCTACATCCACCATCCCTAAAATGCrTGGCAC 

CTTCTGGTTTACCCTGAGAGAAATCTCCTTTGAAGGATGCCTTACCCAGATGTTCTTCATCC 

ACCTGTGCACTGGCATGGAATCAGCTGTGCTTGTGGCCATGGCCTATGATTGCTATGTGGC 

CATCTGTGACCCTCTTTGCTACACGTTGGTGCTGACAAACAAGGTGGTGTCAGTTATGGCA 

CTGGCCATCTTTCTGAGACCCTTAGTCTTTGTCATACCCTTTGTTCTATTTATCCTAAGGCT 

TCCATTTTGTGGACACCAAATTATTCCTCATACTTATGGTGAGCACATGGGCATTGCCCGC 

CTGTCTTGTGCCAGCATCAGGGTTAACATCATCTATGGCTTATGTGCCATCTCTATCCTGGT 

CTTTGACATCATAGCAATTGTCATTTCCTATGTACAGATCCTTTGTGCTGTATTTCTACTCT 

CTTCACATGATGCACGACTCAAGGCATTCAGCACCTGTGGCTCTCATGTGTGTGTCATGTT 

GACTTTCTATATGCCTGCATTTTTCTCATTCATGACCCATAGGTTTGGTCGGAATATACCTC 

ACTTTATCCACATTCTTCTGGCTAA1TTCTATGTAGTCATTCCACCTGCTCTCAACTCTGTA 

ATTTATGGTGTCAGAACCAAACAGATTAGAGCACAAGTGCTGAAAATGTTTTTCAATAAAT 
AA (SEQ ID NO: 454). 



AOLFR243 sequences: 

MEQVhJKTVVREFVVLGFSSLAJ^QQLLFVIFLLLYLFTLGTNAmSTIVLDP^H 
SCSEICYTFVIVPKMLVDIXSQKKTIS^ 

YSVmGHGVCMGIJVLAAACACGFIVSLVTTSLVFHU'FHSSNQLHH^ 

SQLVffMLGWALVIPLLLILVSYIRnSAILICIPSSVGRYKTFSTCASHLrN^TVHYSCASFIYLRPK 
TNYTSSQDTLISVSYmTPLFWMT/SLRNKEFKSALRRTIGQTFYPLS (SEQ ID NO: 455). 

ATGGAGCAAGTCAATAAGACTGTGGTGAGAGAGTTCGTCGTCCTCGGCTTCTCATCCCTGG 
CCAGGCTGCAGCAGCTGCTCTTTGTTATCTTCCTGCTCCTCTACCTGTTCACTCTGGGCACC 
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GCAGTCACGGTTTTCTATGGGACAATGCTATTTATGTATTTGCAGCCCCAAACCAACCACT 
CACTGGATACTGATAAGATGGCTTCTGTGTTTTACACATTGGTGATTCCTATGCTGAATCC 
CTTGATCTACAGCCTGAGGAATAATGATGTAAATGTTGCCTTAAAGAAATTCATGGAAAAT 

CCATGTTACTCCTTTAAATCAATGTAA (SEQ ID NO: 446). 



5 



AOLFR239 sequences: 

MDPQNYSLVSEFVLHGLCTSIU1LQNFFFIFFFGVYVAIMLGNLLILVTVISDPCLHSSPMYFLLG 
NLAFLDMWLASFATPKN11RDFLSDQKLISFGGCMAQIFFLHFTGGAEMVLLVSMAYDRYVA1C 
KPLHYMTLMSWQTCIFJ-VLASWVVGFVHSISQVAFIVNLPYCGPNE\nDSFFCDLPLVIKLACM 
10 DTYVLGIIMISDSGLLSLSCFLLLLISYTVILLAIRQRAAGSTSKALSTCSAHIMVVTLFFGPCIFV 
YNOIPFSI^SVDKLLSWYTIFTPLLNPIIYTLRNEEMKAAMKKLQNRRVTFQ (SEQ ID NO: 447). 



ATGGACCCACAGAACTATTCCTTGGTGTCAGAATTTGTGTTGCATGGACTCTGCACTTCAC 
GACATCTTCAAAATTTTTTCTTTATATTTTTCTTTGGGGTCTATGTGGC 

1 5 AACCTTCTCATTTTGGTCACTGTAATTTCT^ 

CCTGCTGGGGAACCTAGCTTTCCTGGACATGTGGCTGGCCTCATTTGCCACTCCCAAGATG 

ATCAGGGATTTCCTTAGTGATCAAAAACTCATCTCCrTTGGAGGATGTATGGCTCAAATCT 

TCTTCTTGCACTTTACTGGTGGGGCTGAGATGGTGCTCCTGGTTTCCATGGCCTATGACAG 

ATATGTGGCCATATGCAAACCCTTGCATTACATGACTTTGATGAGTTGGCAGACTTGCATC 

20 AGGCTGGTGCTGGCTTCATGGGTCGTTGGATTTGTGCACTCCATCAGTCAAGTGGCTTTCA 
CTGTAAATTTGCCTTACTGTGGCCCCAATGAGGTAGACAGCTTCTTCTGTGACCTCCCT 
GTGATCAAACTTGCCTGCATGGACACCTATGTCTTGGGTATAATTATGATCTCAGACAGTG 
GGTOCTTTCCTTGAGCTGirrTCTGCTCCTCCTGATCTCCTACACCGTGATCCTCCTCGCT 
ATCAGACAGCGTGCTGCCGGTAGCACATCCAAAGCACTCTCCACTTGCTCTGCACATATCA 

25 TGGTAGTGACGCTGTTCTTTGGCCCTTGCATTTTTGTTTATGTGCGGCCTrTCAGTAGGTTC 
TCTGTGGACAAGCTGCTGTCTGTGTTTTATACCATTTTTACTCCACTCCTGAACCCCATTAT 
CTACACATTGAGAAATGAGGAGATGAAAGCAGCTATGAAGAAACTGCAAAACCGACGGGT 

GACTTTTCAATGA (SEQ ID NO: 448). 

30 AOLFR240 sequences: 

MAGENHTTLPEFLLLGFSDLKALQGPLFWVVLLVYLVTLLGNSLnLLTQVSPALHSPMYFFLR 
OLSVVELFYTTDIWRTLAOT.GSPHPQAISFQGCAAQMYVFIVLGISECCLLTAMAYDRYVAIC 

OPLRYSTLLSPP^CLAMVGSSWLTGIITATTHAS^ 

SEISVMTATrVFIMIPFSLrVTSYTRn^ 
3 5 AGSSVTTDRVLSLFYTVITPMLNPiryTLRNKDVRRALRHLVKRQRPSP (SEQ ID. NO: 449). 

ATGGCTGGGGAAAACCATACTACACTGCCTGAATTCCTCCTTCTGGGATTCTCTGACCTCA 
AGGCCCTGCAGGGCCCCCTGTTCTGGGTGGTGCTTCTGGTCTACCTGGTCACOTGC^ 
TAACTCCCTGATCATCCTCCTCACACAGGTCAGCCCTGCCCTGCACTCCCCCATGTACTTCT 
40 TCCrGCGCCAACTCTCAGTGGTGGAGCTCTTCTACACCACTGACATCGTGCCCAGGACCCT 
GGCCAATCTGGGCTCCCCGCATCCCCAGGCCATCTCTTTCCAGGGCTGTGCAGCCCAGATG 
TACGTCTTCATTGTCCTGGGCATCTCGGAGTGCTGCCTGCTCACGGCCATGGCCTATGACC 
GATATGTTGCCATCTGCCAGCCCCTACGCTATTCCACCCTCrTGAGCCCACGGGCCTGCTT 
GGCCATGGTGGGGTCCTCCTGGCTCACAGGCATCATCACGGCCACCACCCATGCCTCCCTC 

45 ATCTTCTCTCTACCTTTTCGCAGCCACCCGATCATC 

AGTACTGAGGCTGGCAAGTGCTGGGAAGCACAGGAGCGAGATCTCCGTGATGACAGCCAC 

CATAGTCTTCATTATGATCCCCTTCTCTCTGATTGTCACCTCTTACATCCGCATCCTGGGTG 

CCATCCTAGCAATGGCCTCCACCCAGAGCCGCCGCAAGGTCTTCTCCACCTGCTCCTCCCA 

TCTGCTCGTGGTCTCTCTCTTCrTTGGAACAGCCAGCATCACCTACATCCGGCCGCAGGCA 

50 GGCTCCTCTGTTACCACAGACCGCGTCCTCAGTCTCTTCTACACAGTCATCACACGCA 

CAACCCCATCATCTACACCCrTCGGAACAAGGACGTGAGGAGGGCCCTGCGACACTTGGT 

GAAGAGGCAGCGCCCCTCACCCTGA (SEQ ED NO: 450). 

AOLFR241 sequences: 
55 MPOMFTYENMFYFFPPLQILAENLTMVTEFLLLGFSSLGEI^ 
" VTIttDKSLHTPMYFFLGIL^TSETFYTFVILPKMLINLLSVARTISFNCCALQ^ 
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CCCCTGCACTACCCAGTGGCCATGAACAGAAGGATGTGTGCAGAAATGGCTGGAATCACC 

TGGGCCATAGGTGCCACGCACGCrGCAATCCACACCTCCCTCACCTTCCGCCTGCTCTACT 

GTGGGCCTTGCCACATTGCCTACTTCTTCTGCGACATACCCCCTGTCCTAAAGCTCGCCTGT 

ACAGACACCACCATTAATGAGCTAGTCATGCTTGCCAGCATTGGCATCGTGGCTGCAGGrT 

5 GCCTCATCCTCATCGTTATTTCCTACATCTTCATCGTGGCAGCTGTGTTGCGCATCCG^CACA 

GCCCAGGGCCGGCAGCGGGCCTTCTCCCCCTGCACTGCCCAGCTCACTGGGGTGCTCCTGT 

^^^ CACCTGTCTGTATCTA CCTGCAGCCTCGCTCCAGTGAGGCAGGAGCTGGGGC 

CCCTGCTGTCTTCTACACAATCGTAACTCCAATGCTCAACCCATTCATTTACACTTTGCGGA 

ACAAGGAGGTGAAGCATGCTCrGCAAAGGCnTTTGTGCAGCAGCTTCCGAGAGTCTACAG 
1 0 CAGGCAGCCCACCCCCATAG (SEQ ID NO: 442). 

AOLFR237 sequences: 

MDQRNYTRVKEFITLGITQSREI^QVLF^ 

RNI^n.DICFSSITAPKVLIDLLSETKTISFSGCVTQMFFFHLLGGADVFSLSVMAFDRYlAISKPL 
HYMTIMSRGRCTGLIVGFLGGGLVHSIAQISLLLPLPVCGPNVLDTFYCDVPQVIJCLAC^ 
LELLMISNNGLVSWFVFITLLISYTVILMMLRSHTGEGRRKAISTCTSHITWTLHFVPCrYW 
MFTAU>TOTAISVTFTVISPLLNPIIYTLP^QEMKXAMRKL (SE Q ID NO: 

20 ATGGATCAGAGAAATTACACCAGAGTGAAAGAATTTACCTTCCTGGGAATTACTCAGTCCC 
GAGA ^^ GCCAGGTOTA ^^ 

AAACTTCCTCATCATGGTTACAGTTACCTGTGAATCTCACCTTCATACGCCCATGTACTTCC 

TGCTCCGCAACCTGTCTATTCTTGACATCTGCTTTTCCTCCATCACAGCTCCTAAGGTCCrrG 

ATAGATCITCTATCAGAGACAAAAACCATCTCCTTCAGTGGCTGTGTCACTCAAATGTTCT 

25 TCTTCCACCTTCTGGGGGGAGCAGACGTTTTTTCTCTCTCTGTGATGGCGTTTGACCGCTAT 

ATAGCCATCTCCAAGCCCCTGCACTATATGACCATCATGAGTAGGGGGCGATGCACAGGCC 

TCATCGTGGGCTTCCTGGGTGGGGGGCTTGTCCACTCCATAGCGCAGATTTCTCTATTGCT 

CCCACTCCCTGTCrGTGGACCCAATGTTCTTGACACTTTCTACTGCGATGTCCCCCAGGTCC 

- n TCAAACTTGCCrGCACTGACACCTTCACTCTGGAGCTCCTGATGATrTCAAATAATGGG-TT 

30 AGTCAGTTGGTTrGTATTCTTCTTTCTCCTCATATCTTACACGGTCATCTTGATGATGCTGA 

GGTCTCACACTGGGGAAGGCAGGAGGAAAGCCATCTCCACCTGCACCTCCCACATCACCG 

TGGTGACCCTGCATTTCGTGCCCTGCATCTATGTCTATGCCCGGCCCTTCACTGCCCTCCCC 

ACAGACACTGCCATCTCTGTCACCITCACTGTCATCTCCCCTTTGCTCAATCCTATAATTTA 

i z CAGGCTGAGGAATCAGGAAA TGAAGTTGGCCATGAGGAAACTGAAGAGACGGCTAGGAC 
35 AATCAGAAAGGATTTTAATTCAATAA (SEQ ID NO: 444). 



AOLFR238 sequences: 

MAPENFTRVTEFILTGVSSCPELQIPLFEYFLVLYVLTMAGNLGnTLTSVDSRLO 

AIINLGNSTVIAPKMLMNFLVKIGCTTSFYECATQLGGFLFFIVSEVMMLAVMAYDRYVAICN^ 

40 LLYMVVVSRRLCLLLVSLTYLYGFSTAR^SPCIFSVSYCSSNnNHFYCDIAPLLALSCSDTYIPE 

TIWISAATOLFFSMITVLVSYFNnVI^ILRmSPEGPJKJCAFSTCASHMIAVTVFYGm 

QT^SLDTDKMASVFYTLVIPMLNPLIYSLRNND\aWALKKTlvmWCYSFKSM (SEQ ID NO: 
445). 

45 ATGGCTCCTGAAAATTTCACCAGGGTCACTGAGTTTATTCTCACAGGTGTCTCTAGCTGTC 
CAGAGCTCCAGATTCCCCTCTTCCTGGTCTTCCTAGTGCTCTATGTGCTGACCATGGCAGG 
GAACCTGGGCATCATCACCCTCACCAGTGTTGACrCTCGACrTCAAACCCCCATGTACTTTT 
TCCTGAGA i^ CTAGCTATC ^^ 

ATGAACi i i i iAGTAAAGAAGAAAACTACCTCATTCTATGAATGTGCCACCCAACTGGGAG 
50 GGTTCTTGTTCTTTATTGTATCGGAGGTAATGATGCTGGCTGTGATGGCCTATGACCGCTA 
TGTGGCCATTTGTAACCCTCTGCTCTACATGGTGGTGGTGTCTCGGCGGCTCTGCCTCCTGC 
TGGTGTCCCTCACGTACCTCTATGGCTTTTCTACAGCTATTGTGGTTTCACCTTGTATATTC 
TCTGTGTCTTATTGCTCTTCTAATATAATCAATCATTTTTACTGTGATATTGCACCTCTGTT 
AGCATTATCTTGCTCTGATACTTACATACCAGAAACAATAGTCTTTATATCTGCAGCAACA 
35 AATTTGT 1 1 1 1 1 1 CCATGATTACAGTTCTAGTATCTTATTTCAATATTGTTTTGTCCATTCTA 
AGGATACGTTCACCAGAAGGAAGGAAAAAAGCCTTTTCCACCTGCGCTTCGCATATGATA 
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CTCATTGTGACCGTCACCACCTGTGACAGCAGCCTTCACATGCCCATGTACTTCTTCCTCAG 
GAATCTGTCTATCTTGGATGCCTGCrACATTTCTGTTACAGTCCCTACCTCATGTGTCAATT 
COCTACTGGACAGCACCACCATTTCTAAGGCGGGATGTGTAGCTCAGGTCTTCCTCGTGGT 

tWttgtatatgtggagcttctgtttctcaccattatggctcatgaccgctatgtggctg 

5 TCTGCCAGCCACTrCACTACCCTGTGATCGTGAACTCTCGAATCTGCATCCAGATGACACT 
GGCCTCCCTACTCAGTGGTCTTGTCTATGCAGGCATGCACACTGGCAGCACATTCCAGCTG 
CCCTTCTGTCGGTCCAACGTTATTCATCAATTCTTCTGTGACATCCCCTCTCTGCTGAAGCT 
CTCTTGCTCTGACACCTTCAGCAATGAGGTCATGATTGTTGTCTCTGCTCTGGGGGTAGGT 
GGCGGCTGTTTCATCTTTATCATCAGGTCTTACATTCACATCTTTTCGACCGTGCTCGGGTT 
10 TCCAAGAGGAGCAGACAGAACAAAGGCCTTTTCCACCTGCATCCCTCACATCCTGGTGGTG 

TCAGTCTTCCnOAGTTGATG 

CCAGGATCTGATCCTTTCTGGTTTTTATTCCATAATGCCTCCCCTCTTTAACCCTATTATTTA 
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CAGTCTTAGAAATAAGCAAATAAAGGTGGCCATCAAGAAAATCATGA^^ 
TTCAGAAAATGTGTAA (SEQ ID NO: 438). 

AOLFR235 sequences: 

MDGVNDSSLQGFVLMGISDHPQLEMIFFIAIIJSY]^TLLGNSTIILI^RLEAIU.HTPMYFF^NL 
SSLDLAFATSSVPQMLINLWGPGKTISYGGCITQLYVFLWLGATECILLVVMAFDRYVAVCRPL 

20 NOAVLNGVCTFFTAVPLSIIVISYCLIAQAVLH^ 

LLPAKNSKQDQGKFlSLFYSLVTPMVNPLIYTLRl^EVKGALPvRLLGKGREVG (SEQ ID NO: 

439). 

ATGGACGGGGTGAATGATAGCTCCTTGCAGGGCTTTGTTCTGATGGGCATATCAGACCATC 
25 CCCAGCTGGAGATGATCTTTTTTATAGCCATCCTCITCTCCTATTTGCTGACCCTACTTGGG 
AACTCAACCATCATCTTGCTTTCCCGCCTGGAGGCCCGGCTCCATACACCCATGTACTTCTT 
CCTCAGCAACCTCTCCTCCTTGGACCTTGCTTTCGCTACTAGTTCAGTCCCCCAAATGCTGA 
TCAATTTATGGGGACCAGGCAAGACCATCAGCTATGGTGGCTGCATAACCCAGCTCTATGT 
CTTCCTTTGGCTGGGGGCCACCGAGTGCATCCTGCTGGTGGTGATGGCATTTGACCGCTAC 
30 GTGGCAGTGTGCCGGCCCCTCCGCTACACCGCCATCATGAACGCCCAGCTCTGCTGGCTGC 
TGGCTGTGATTGCCTGCCTGGGTGGCTTGGGCAACTCTGTGATCCAGTCAACATTCACTCT 
GCAGCTCCCATTGTGTGGGCACCGGAGGGTGGAGGGATTCCTCTGCGAGGTGCCTGCCAT 
GATCAAACTGGCCTGTGGCGACACAAGTCTCAACCAGGCTGTGCTCAATGGTGTCTGCACC 
TTCTTCACTGCAGTCCCACTAAGCATCATCGTGATCTCCTACTGCCTCATTGCTCAGGCAGT 
35 GCTGAAAATCCGCTCTGCAGAGGGGAGGCGAAAGGCGTTCAATACGTGCCTCTCCCATCT 
GCTGGTGGTGTTCCTCTTCTATGGCTCAGCCAGCTATGGGTATCTGCTTCCGGCCAAGAAC 
AGCAAACAGGACCAGGGCAAGTTCATTTCCCTGTTCTACTCGTTGGTCACACCCATGGT 
ATCCCCTCATCTACACGCTGCGGAACATGGAAGTGAAGGGCGCACTGAGGAGGTTGCTGG 

GGAAAGGAAGAGAAGTTGGCTGA (SEQ ID NO: 440). 

40 

AOLFR236 sequences: 

MTSQERDTAIYSINVSFVAKGMTSRSVCEKMTMTTENPNQTVVSHFFI^Gl^YT 

LFIJLrySITVAGNLLILLTVGSDSHLSLPMYHFLGHLSFLDACI^TVTVPKVMAGLLTI^ 

FEGCAVOLYCFHFLASTECFLYTVMAYDRYLAICQPLHYPVAMNRRMCAEMAGITWAIGATH 

At AAIHTSLTTRLLYCGPCHIAYFFCDIPPVLKIACrDTTINELVMI^SIGIV 
TLRNKEVKHALQRLLCSSFRESTAGSPPP (SEQ ID NO: 441). 

ATGACATCTCAGGAAAGGGATACAGCrrATrTATTCCATTAATGTCAGTTTTGTTGCAAAGG 
50 GGATGACTAGCCGCTCTGTGTGTGAGAAGATGACCATGACAACGGAGAACCCCAACCAGA 
CTGTGGTGAGCCACTTCTTCCTGGAGGGTTTGAGGTACACCGCT 

CTTCCTCCTCTrCCTCCTCATCTACAGCATCACTGTGGCTGGGAATCTCCTCATCCTCCTAA 

c^5Sgctctgactctcacctcagcttacccatgtaccacttcctggggc 

CTGGATGCCTGTTrGTCTACAGTGACAGTGCCCAAGGTCATGGCAGGCCTGCrGACTC 
55 ATGGGAAGGTGATCTCCTTTGAGGGCTGTGCCGTACAGCTTTATTGCTrCCA 

CAGCACTGAGTGCTTCCTGTACACAGTCATGGCCTATGACCGCTATCTGGCTATCTGTCAA 
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TfflGMNSAEGRKKAFATCSSHLTVVlLJ^GAAryTY 
LIYSLRNKDVMGALKKMLTVEPAFQKAME (SEQ ID NO: 433). 

ATGGACAACATCACCTGGATGGCCAGCCACACTGGATGGTCGGATTTCATCCTGATGGGAC 

» tcttcagacaatccaaacatccaatggccaatatcacctggatggccaaccacactggaYg 

GTCGGATTTCATCCTGTTGGGACTCTTCAGACAATCCAAACATCCAGCACTACTrTGTraTG 
GTCATTTTTGTGGTTTTCCTGATGGCGTTGTCTGGAAATGCTGTCCTGATCCTTCTGATACA 
CTGTGACGCCCACCTCCACACCCCCATGTACTTTTTCATCAGTCAATTGTCTCTCATGGACA 
TGGCGTACATTTCTGTCACTGTGCCCAAGATGCTCCTGGACCAGGTCATGGGTGTGAATAA 




CCCTGTCCTCATGAACCATAGGGTGTGTCTCTTCCTGTCATCAGGCTGCTGGTTCCTGGGCT 

CAGTGGATGGCTTCACATTCACTCCCATCACCATGACCTTCCCCTTCCGTGGATCCCGGGA 

GATTCATCATTTCTTCTGTGAAGTTCCTGCTGTATTGAATCTCTCCTGCTCAGACACCTCAC 

TCTATGAGATTTTCATGTACTTGTGCTGTGTCCTCATGCTCCTCATCCCTGTGGTGATCATT 

TCAAGCTCCTATTTACTCATCCTCCTCACCATCCACGGGATGAACTCAGCAGAGGGCCGGA 

AAAAGGCCTTTGCCACCTGCTCCTCCCACCTGACTGTGGTCATCCTCTTCTATGGGGCTGCC 

ATCTACACCTACATGCTCCCCAGCTCCTACCACACCCCTGAGAAGGACATGATGGTATCTG 

TCTTCTATACCATCCTCACTCCAGTGGTGAAGCCTTTAATCTATAGTCITAGGAATAAGG^ 

AOLFR233 sequences: 

MANITRMANHTGKLDFILMGLFRRSKHPALI^VVIF\nTLKAI^GNAVLILLmCDAHLHSPMY 

FFISQLSLMDMAYISVTVPKMLLDQVMGVNKVSAPECGMQMFLYLTLAGSEFFLLATMAYDR 

YVAJCHPLRYPVLMNHRVCLFLASGCWFLGSVDGFMLTPITMSFPFCRSWEIHHFFCEVPAVTI 

LSCSDTSLYETLMYLCCVLMLLIPVTIISSSYLLILLTVHRMNSAEGPJCKAFATCSSHLTVV1LFY 

GAAVYTYMLPSSYHTPEKDMMVSVFYTILTPVLNPLIYSLPJ^VMGALKK^ 
(SEQ ID NO: 435). 

ATGGCCAACATCACCAGGATGGCCAACCACACTGGAAAGTTGGATTTCATCCTCATGGGAC 

TCTTCAGACGATCCAAACATCCAGCTCTACITAGTGTGGTCATCTTTGTGGTrTTCCTGAAfT 

GCGTTGTCTGGAAATGCrGTCCTGATCCTTCrGATACACTGTGACGCCCACCTCCACAGCC 

CCATGTAC J 11 i i CATCAGTCAATTGTCTCTCATGGACATGGCGTACATTTCTGTCACTGTG 

CCCAAGATGCTCCTGGACCAGGTCATGGGTGTGAATAAGGTCTCAGCCCCTGAGTGTGGG 

ATGCAGATGTTCCTCTATCTGACACTAGCAGGTTCGGAATTTTTCCTTCTAGCCACCATGGC 

CTATGACCGCTACGTGGCCATCTGCCATCCTCTCCGTTACCCTGTCCTCATGAACCATAGG 

GTCrGTCrTTTCCTGGCATCGGGCTGCTGGTTCCTGGGCTCAGTGGATGGCTTCATGCTCAC 

TCCCATCACCATGAGCTTCCCCTTCTGCAGATCCTGGGAGATTCATCATTTCTTCTGTGAAG 

IS™™^ CGATCCTGTCCTGCTCAGACACCTCACT 

TGCrGTGTCCTCATGCrCCrCATCCCrGTGACGATCATTTCAAGCTCCTATTTACTCATCCT 

CCTCACCGTCCACAGGATGAACTCAGCAGAGGGCCGGAAAAAGGCCTTTGCCACCTGCTC 

CTCCCACCTGACTGTGGTCATCCTCTTCTATGGGGCTGCCGTCTACACCTACATGCTCCCCA 

GCTCCTACCACACCCCTGAGAAGGACATGATGGTATCTGTCTTCTATACCATCCTCACTCC 

GGTGCTGAACCCrrTAATCTATAGTCTTAGGAATAAGGATGTCATGGGGGCTCTGAAGAAA 

ATGTTAACTGTGAGATTCGTCCTTTAG (SEQ ID NO: 436). 

AOLFR234 sequences: 

l^NSTTVMEFLLMRFSDVWTLQILHSASFFMLY^ 

lsildacyisvtvftscvnslldstttskack:vaqvflvwfvyvellfltimahdryvavcqpl 

HYPVIVNSRICIQMTLASLLSGLVYAGMHTGSTFQLPFCRSNVIHQFFCDIPSLLKLSCSDTFSNE 
VMIWSALGVGGGCWFIIRSY1HIFSTVLGFPRGADRTKAFSTC1PHILWS 
AffAATQDLILSGFYSIMPPLFNPirYSLRNKQIKVAIKKIMKRIFYSENV (SEQ ID NO: 437). 

ATGCCCAATTCAACCACCGTGATGGAATTTCTCCrCATGAGGTTTTCTGATGTGTGGACAC 
TACAGATTTTACATTCTGCATCCTTCTTTATGTTGTATTTGGTAACTCTAATGGGAAACATC 



WO 01/98526 



PCT/US01/20122 



AOLFR230 sequences: 

MGMEGLLQNSTNFVLTGLITHPAFPGLLFAIWSIFWAITANLVMILLfflMDSRLHTPMYFLLS 
OLSIMDTIYICITVPKMLQDLLSKDKTISFLGCAVQIFLYLTLIGGEFFLLGLMAYDRYVAVCNP 
LRYPLLMNMIVCLFMVVGSWVGGSLDGFMLTPVTMSFPFCRSKEINHFFCEIPAVLKLSCTDTS 
5 LYETmYACCVLMLLIPLSVISVSYTHILLTVHRMNSAEGRRKAFATCSSHIMVVSVFYGAAFY 
TNV1JHSYHTPEKDKWSAFYTILTPMLNPLIYSLRNKDVAAALPJCVLGRCGSSQSIRVATVIR 

KG (SEQ ID NO: 429). 

ATGGGCATGGAGGGTCTTCTCCAGAACTCCACTAACTTCGTCCTCACAGGCCTCATCACCC 
1 0 ATCCTGCCTTCCCCGGGCTTCTCTTTGCAATAGTCTTCTCCATCTTTGTGGTGGCTATAACA 
GCCAACTTGGTCATGATTCTGCTCATCCACATGGACTCCCGCCTCCACACACCCATGTACTT 
CTTGCTCAGCCAGCTCTCCATCATGGATACCATCTACATCTGTATCACTGTCCCCAAGATGC 
TCCAGGACCTCCTGTCCAAGGACAAGACCATTTCCTTCCTGGGCTGTGCAGTTCAGATCTT 
CCTCTACCTGACCCTGATTGGAGGGGAATTCTTCCTGCTGGGTCTCATGGCCTATGACCGC 
1 5 TATGTGGCTGTGTGCAACCCTCTACGGTACCCTCTCCTCATGAACCGCAGGGTTTGCTTATT 
CATGGTGGTCGGCTCCTGGGTTGGTGGTTCCTTGGATGGGTTCATGCTGACTCCTGTCACT 
ATGAGTTTCCCCTTCTGTAGATCCCGAGAGATCAATCACTTTTTCTGTGAGATCCCAGCCGT 
GCTGAAGTTGTCTTGCACAGACACGTCACTCTATGAGACCCTGATGTATGCCTGCTGCGTG 
CTGATGCTGCrTATCCCTCTATCTGTCATCTCTGTCTCCTACACGCACATCCTCCTGACTGT 
20 CCACAGGATGAACTCTGCTGAGGGCCGGCGCAAAGCCTTTGCTACGTGTTCCTCCCACATT 
ATGGTGGTGAGCGTTTTCTACGGGGCAGCCTTCTACACCAACGTGCTGCCCCACTCCTACC 
ACACTCCAGAGAAAGATAAAGTGGTGTCTGCCTTCTACACCATCCTCACCCCCATGCTCAA 
CCCACTCATCTACAGCTTGAGGAATAAAGATGTGGCTGCAGCTCTGAGGAAAGTACTAGG 
GAGATGTGGTTCCTCCCAGAGCATCAGGGTGGCGACTGTGATCAGGAAGGGCTAG (SEQ ID 

25 NO: 430). 

AOLFR231 sequences: 

MERANHSVVSEFILLGLSKSQNLQILFFLGFSVVFVGIVLGNLLILVT\nTDSLLHTPMYFLLSNL 
SCn^MILASFATPKMTVDFLREPJKTISWWGCYSQMFFMHLLGGSEMMLLVAMAIDRYVAICKP 
30 LHYMTIMSPRVLTGLLI^SYAVGFVHSSSQMAFMLTLPFCGPNVIDSFFCDLPLVIKLACKDTYI 

SlLviadsgI^lvcfllllvsyg^ 

PFSRYSVDKILSVFYTIFTPLLNPIIYTLRNQEVKAAIKKRLCI (SEQ ID NO: 431). 

atggaaagagcaaaccattcagtggtatcggaatttattttgttgggactttccaaatctc 

35 AAAATCirCAGATTTrATrCTTCTTGGGATTCTCTGTGGTCTTCGTGGGGATTGTGTTAGGA 
AAOTGCTCATCTTGGTGACTGTGACCITTGATTCGCTCCTTCACACACCAATGTATTTTCT 
GCTTAGCAACCTCTCCTGCATTGATATGATCCTGGCTTCITTTGCTACCCCTAAGATGATTG 
TAGATTTCCTCCGAGAACGTAAGACCATCTCATGGTGGGGATGTTATTCCCAGATGTTCTT 
TATGCACCTCCTGGGTGGGAGTGAGATGATGTTGCTTGTAGCCATGGCAATAGACAGGTAT 
40 GTTGCCATATGCAAACCCCTCCATTACATGACCATCATGAGCCCACGGGTGCTCACTGGGC 
TACTGTTATCCTCCTATGCAGTTGGATTTGTGCACTCATCTAGTCAAATGGCTTTCATGTTG 
ACITTGCCCTTCTGTGGTCCCAATGTTATAGACAGCTTTTTCTGTGACCTTCCCCTTGTGAT 
TAAACTTGCCTGCAAGGACACCTACATCCTACAGCTCCrGGTCATTGCTGACAGTGGGCTC 
CTGTCACTGGTCTGCITCCTCCTCrTGCTTGTCTCCTATGGAGTCATAATATTCTCAGTTAG 
GTACCGTGCTGCTAGTCGATCCTCTAAGGCTTTCTCCACTCTCTCAGCTCACATCACAGTTG 
TGACTCTGTTCTTTGCTCCGTGTGTCTTTATCTACGTCTGGCCCTTCAGCAGATACTCGGTA 
GATAAAATTCTTTCTGTGTTTTACACAATTTTCACACCTCTCTTAAATCCT 
ATTAAGAAATCAAGAGGTAAAAGCAGCCATTAAAAAAAGACTCTGCATATAA (SEQ ID NO: 

432). 

AOLFR232 sequences: 

MDNITWMASHTGWSDFILMGLFRQSKlIPMANITWMANHTGWSDFn.LGLFRQSKHPALLCV 
WVVF^SGNAVLILLmCDAHLHTPMYFFISQLSLMD^ 

APECGMQMFFYWLAGSEFFLLATMAYDRYVA1CHPLRYPVLMNHRVCLFLSSGCWFLGSVD 
G^TPimTFPFRGSPJEIHHFFCEVPAVLNLSCSDTSLYEIFMYLCCVLMLLIPVVnSSSYLLILL 
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AAAGGCACTCTCAGCTTTTGACACCAACAAGTTGGTCTCTGTACTGTATGCTGTCATTGTA 
CCATTGCTCAATCCCATCATTTACTGCCTGCGCAATCAAGAGGTCAAGAGAGCCCTATGCT 

O (oJbvj ID NO: 424). 
AOLFR227 sequences: 

MEPQNTSTVTM^QLLGFQNIXEWQALLFVIFLLIYCLTnGNVVriTVVSQ 
I^FLEVWYTSTTVPLLLANLI^WGQAISFSACMAQL^^ 

LRYPFLNfflRGLCARLVWSWCTGVSTGFLHSMMISW.DFCGRNQINHFFCDLPPLMOLSCSRV 
YITEVTIFIl^IAVLCICFFLTLGPYWIVSSILRIPSTSGRRKTFSTCGSHLAWTLYYG^S^ 
CPSPHLLPEIh^ISWYTVVTPLLNPVIYSLPJ^FKEAVPJCVMPJOCCGIL 
LY (SEQ ID NO: 425). 

ATGGAGCCCCAAAATACCTCCACTGTGACTAACTTTCAGCTGTTAGGATTCCAGAACCTTC 

TTGAATGGCAGGCCCTGCTCTTTGTCATTTTCCTGCTCATCTACTGCCTGACCATTATAGGG 

AATGTTGTCATCATCACCGTGGTGAGCCAGGGCCTGCGACTGCACTCCCCTATGTACATGT 

TCCTCCAGCATCTCTCCTTTCTGGAGGTCTGGTACACGTCCACCACTGTGCCCCTTCTCCTA 

GCCAACCTGCTGTCCTGGGGCCAAGCCATCTCCTTCTCTGCCTGCATGGCACAGCTCTACT 

TC^CGTATTCCTCGGCGCCACCGAGTGCTTTCTCCTGGCCTTCATGGCCTATGACCGTTAC 

CTGGCCATCTGCAGCCCACTCCGCTACCCCTTTCTCATGCATCGTGGGCTATGTGCCAGGTT 

GGTGGTGGTCTCATGGTGCACAGGGGTCAGCACAGGCTTTCTGCATTCCATGATGATTTCC 

AGGTTGGACTTCTGTGGGCGCAATCAGATTAACCATTTCTTCTGCGACCTCCCGCCACTCA 

TGCAGCTCTCCTGTTCCAGAGTTTATATCACCGAGGTGACCATCTTCATCCTGTCAATTGCC 

GTGCTGTGCATTTGTTTTJ "J TCTGACACTGGGGCCCTATGTTTTCATTGTGTCCTCCATATT 

GAGAATCCCTTCCACCTCTGGCCGGAGAAAGACCTTTTCCACATGTGGCTCCCACCTGGCT 

GTTGTCACTCTCTACTACGGGACCATGATCTCCATGTATGTGTGTCCCAGTCCCCACCTGTT 

GCCTGAAATCAACAAGATCATTTCTGTCTTCTACACTGTGGTCACACCACTGCTGAACCCA 

G TT A I CTACAGC ™ AGGAACAAAGAOTC ^ 

AAATGTGGTATTCTATGGAGTACAAGTAAAAGGAAGTTCCTTTATTAG (SEQ ID NO- 426) 




ATGTTTTATGTAAATCAGATACCTTTCCAACTTTATCATATCTCTTTCGTGTACCCTACAGA 

GCTATGGAGCAGAGCAATTATTCCGTGTATGCCGACTTTATCCTTCTGGGTTTGTTCAGCA 

ACGCCCGTTTCCCCTGGCTrCTTTGCCCTCATTCTCCTGGTCrTTGTGACCTCCATAGCCAG 

CAACGTGGTCAAGATCATTCTCATCCACATAGACTCCCGCCTCCACACCCCCATGTACTTC 

CTGCTCAGCCAGCTCTCCCTCAGGGACATCCTGTATATTTCCACCATTGTGCCCAAAATGCT 

GGTCGACCAGGTGATGAGCCAGAGAGCCATTTCCTTTGCTGGATGCACTGCCCAACACTTC 

CTCTACTTGACCTrAGCAGGGGCTGAGTTCTTCCTCCTAGGACTCATGTCCTGTGATCGCTA 

CGTAGCCATCTGCAACCCTCTGCACTATCCTGACCTCATGAGCCGCAAGATCTGCTGGTTG 

ATTGTGGCGGCAGCCTGGCTGGGAGGGTCTATCGATGGTTTCTTGCTCACCCCCGTCACCA 

TGCAGTTCCCCTTCTGTGCCTCTCGGGAGATCAACCACTTCTTCTGCGAGGTGCCTGCCCrT 

CTGAAGCTCTCCTGCACGGACACATCAGCCTACGAGACAGCCATGTATGTCTGCTGTATTA 

TGATGCTCCTCATCCCTTTCTCTGTGATCTCGGGCTCTTACACAAGAATTCTCATTACTGTT 

TATAGGATGAGCGAGGCAGAGGGGAGGCGAAAGGCTGTGGCCACCTGCTCCTCACACATG 

GTGGTTGTCAGCCTCTTCTATGGGGCTGCCATGTACACATACGTGCTGCCTCATTCTTACCA 

CACCCCTGAGCAGGACAAAGCTGTATCTGCCTTCTACACCATCCTCACTCCCATGCTCAAT 

CCACTCATTTACAGCCTTAGGAACAAGGATGTCACGGGGGCCCTACAGAAGGTTGTTGGG 

AGGTGTGTGTCGTCAGGAAAGGTAACCACTTTCTAA (SEQ ID NO" 428) 
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GCTGCrGTCTGTCGTCCACTCCACTACATGGCCATCATGCACCCCCATCTCTGCCAGACCCT 
GGCTATCGCCTCCTGGGGTGCGGGTTTCGTGAACTCTCTGATCCAGACAGGTCTCGCAATG 
GCCATGCCTCTCTGTGGCCATCGACTGAATCACTTCTTCTGTGAGATGCCTGTATTTCTGAA 
GTTGGCTTGTGCGGACACAGAAGGAACAGAGGCCAAGATGTTTGTGGCCCGAGTCATAGT 
5 CGTGGCTGTTCCTGCAGCACTTATTCTAGGCTCCTATGTGCACATTGCTCATGCAGTGCTG 
AGGGTGAAGTCAACGGCTGGGCGCAGAAAGGCTTTTGGGACTTGTGGGTCCCACCTCCTA 
GTAGTTTTCCTTTTTTATGGCTCAGCCATCTACACATATCTCCAATCCATCCACAATTATTC 
TGAGCGTGAGGGAAAATTTGTTGCCCTTTTTTATACTATAATTACCCCCATTCTCAATCCTC 
TCATTTATACACTAAGAAACAAGGACGTGAAGGGGGCTCTGTGGAAAGTACTATGGAGGG 

1 0 GCAGGGACTCAGGGTAG (SEQ ID NO: 420). 
AOLFR225 sequences: 

MENYNOTSTDFILLGLFPPSIIDLFFFILIWIFLMALIGNLSMILLIFLDTHLHTPMYFLLSQLSLID 
LNY1STIVPKMASDFLHGNKSISFTGCGIQSFFFLALGGAEALLLASMAYDRYIAICFPLHYLIRM 
1 5 SKRVCVLMITGSWIIGSINACAHTVYVLHIPYCRSRAINHFFCDVPAMVTLACMDTWVYEGTV 
FLSATIFLWPFIGISCSYGQVLFAVTHMKSAEGRKKAYLTCSTHLTVVTFYYAPFVYTYLRPRS 
LRSPTEDKVLAVFYTILTPMLNPIIYSLRNKEVMGALTRVSQR1CSVKM (SEQ ID NO: 421). 

ATGGAAAATTACAATCAAACATCAACTGATTTCATCITATTGGGGCTGTTTCCACCATCAA 
20 TAATTGACCTTTTCTTCTTCATTCTCATTGTTTTCATTTTCCTGATGGCT 

TGTCCATGATTCTTCTCATCTTCTTGGACACCCATCTCCACACACCCATGTATTTCCTACTG 
AGTCAGCTCTCCCTCATTGACCTAAATTACATCTCCACCATTGTTCCTAAGATGGCATCTGA 
mCTGCATGGAAACAAGTCTATCTCCTrCACTGGGTGTGGGATTCAGAGTTTCTTCTTCT 
TGGCATTAGGAGGTGCAGAAGCACTACTTTTGGCATCTATGGCCTATGATCGTTACATTGC 
25 TATTTGCTTTCCTCTCCACTATCTCATCCGCATGAGCAAAAGAGTGTGTGTGCTGATGATA 
ACAGGGTCTTGGATCATAGGCTCGATCAATGCTTGTGCTCACACTGTATATGTACTCCATA 
TTCCTTATTGCCGATCCAGGGCCATCAATCATTTCTTCTGTGATGTCCCAGCAATGGTGACT 
CTGGCCTGCATGGACACCTGGGTCTATGAGGGCACAGTGTTTTTGAGTGCCACCATCTTTC 
TCGTGTTTCCCTTCATTGGTATTTCATGTTCCTATGGCCAGGTTCTCTTTGCTGTCTACCAC 
30 ATGAAATCTGCAGAAGGGAGGAAGAAAGCCTATTTGACCTGCAGCACCCACCTCACTGTA 
GTAA(^CTACTATGCACCITITGTCTACACTTATCTACGTCCAAGATCCCT^ 
AACAGAGGACAAGGTTCTGGCTGTCTTCTACACCATCCTCACCCCAATGCTCAACCCCATC 
ATCTATAGCCTGAGGAACAAGGAGGTGATGGGGGCCCTGACACGAGTGAGTCAGAGAATC 

TGCTCTGTGAAAATGTAG (SEQ ID NO: 422). 

35 

AOLFR226 sequences: 

MEWRNHSGRVSEF\^LGFPAPAPLQVLLFALLLLAY\n.VLTENTUIMAIRNHSTLHKTMYFF^ 
ANMSFLEIWYVTNO-IPKMLAGFVGSKQDHGQLISFEGCMTQLYFFLGLGCrreCVLIJVVMAYD 
RYMAICYPLHYPVIVSGRLCVQMAAGSWAGGFGISMVKWLISGI^YCGPMINHFFCDVSPLL 
40 NI^CTDMSTAELTDFU.AinLLGPLSVTGASYVMTGAVMHISSAAGRYKAFSTCASHLTVVIIF 

"YAASIFIYARPKAI^AFim«L 
IcKASRNV (SEQ ID NO: 423). 

ATGGAGTGGCGGAACCATAGTGGGAGAGTGAGTGAGTTrGTGTTGCTGGGCTTCCCTGCT 
45 CCTGCGCCACTACAGGTACTATTGTTTGCCCTmGCTGCTGGCCTATGTGTTGGTGCTGAC 
TGAGAACACACTCATCATTATGGCAATTAGGAACCATTCTACCCTCCACAAACCCATGTAC 
TTTTTTCTAGCTAATATGTCCTTTCTGGAGATCTGGTATGTCACTGTCACTATTCCCAAGAT 
GCTTGCTGGCITTGTTGGATCCAAACAGGATCATGGACAGCTAATCTCCTrTGAGGGATGC 
ATGACACAGCTCTACTTTTTCCTTGGCTrGGGCTGCACTGAGTGTGTCCITCTCGCTGTTAT 
50 GGCCTATGATCGCTATATGGCCATCTGCTATCCTCTCCACTACCCAGTCATTGTCAGTGGCC 
GGCTGTGTGTGCAGATGGCTGCTGGCTCTTGGGCTGGAGGTTTTGGCATCTCCATGGTCAA 

AGTTTTTCITATTTCTGGCCTCTCTrACTGTGGCC 

ATGTCTCTCCATTGCTCAACCTCTCATGCACTGATATGTCCACAGCAGAGCrTACAGATTTC 

ATCCTGGCCATTITTATTCTTCTAGGGCCACTCTCTGTCACTGG^ 
55 TACTGGTGCTGTGATGCACATATCrTCGGCTGCTGGACGCTATAAGGCCTTTTCCACCTGT 
G^CTCTCATCTCA^CTGTTGTGATAATCTTCTATGCAGCCAGTATC 
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ATGGGGCAGACCAACGTAACCTCCTGGAGGGATTTTGTCTTCCTGGGCTTCTCCAGTTCTG 

GGGAGTTGCAGCTCCTTCTCTTTGCCTTGTTCCTCTCTCTGTATCTAGTCACTCTGACCAGC 

AATGTCTTCATTATCATAGCCATCAGGCTGGATAGCCATCTGCACACCCCCATGTACCTCTT 

CCTTTCCTTCCTATCCTTCTCTGAGACCTGCTACACTTTGGGCATCATCCCTAGAATGCTCT 

5 CTGGCCTGGCTGGGGGGGACCAGGCTATCTCCTATGTGGGCTGTGCTGCCCAGATGTTCTT 

TTCTGCCTCATGGGCCTGTACTAACTGCTTCCTTCTGGCTGCCATGGGCTTTGACAGATATG 

TGGCCATCTGTGCTCCACTCCACTATGCCAGCCACATGAATCCTACCCTCTGTGCCCAGCT 

GGTCATTACTTCCTTCCTGACTGGATACCTCTTTGGACTGGGAATGACACTAGTTATTTTCC 

ACCTCTCATTCTGCAGCTCCCATGAAATCCAGCACTTTTTTTGTGACACGCCACCTGTGCTG 

1 0 AGCCTAGCCTGTGGAGATACAGGCCCGAGTGAGCTGAGGATCTTTATCCTCAGTCTTTTGG 

TCCTCTTGGTCTCCTTCTTCTTCATCACCATCTCCTACGCCTACATCTTGGCAGCAATACTG 

AGGATCCCCTCTGCTGAGGGGCAGAAGAAGGCCTTCTCCACTTGTGCCTCGCACCTTACAG 

TGGTCATTATTCATTATGGCTGTGCTTCCTTCGTGTACCTGAGGCCCAAAGCCAGCTACTCT 

CTTGAGAGAGATCAGCTTATTGCCATGACCTATACTGTAGTGACCCCCCTCCTTAATCCCA 

1 5 TTGTTTATAGTCTAAGGACTAGGGCTATACAGACAGCTCTGAGGAATGCTTTCAGAGGGAG 
ATTGCTGGGTAAAGGATGA (SEQ ID NO: 4 1 6). 



AOLFR223 sequences: 

MEAANESSEGISFVLLGLTTSPGQQRPLFVLFLLLYVASLLGNGLIVAAIQASPALHAPMYFLLA 

20 HLSFADLCFASVTVPKMLANLLAHDHSISIAGCLTQMYFFFALGVTDSCLLAAMAYDCYVAIR 

HPLPYATPJvlSRAMCAALVG^lAWLVSHVHSLLYILIJVL\RLSFCASHQWHFFCDHQPLI^LSC 

SDTHHIQLLIFTEGAAVVVTPFLLILASYGAIAAAVLQLPSASGRLRAVSTCGSPILAVVSLFYGT 

VIAVYFQATSPJIEAEWGRVATVMYTWTPMLNPIIYSLWNRDVQGALRALLIGRPJSASDS 
(SEQ ID NO: 417). 

25 

ATGGAGGCTGCCAATGAGTCITCAGAGGGAATCTCATTCGTTTTATTGGGACTGACAACAA 
GTCCTGGACAGCAGCGGCCTCTCnTTGTGCTGTTCTTGCTCTTGTATGTGGCCAGCCTCCTG 
GGTAATGGACTCATTGTGGCTGCCATCCAGGCCAGTCCAGCCCTTCATGCACCCATGTACT 
TCCTGCTGGCCCACCTGTCCTTTGCTGACCTCTGTTTCGCCTCCGTCACTGTGCCCAAGATG 

30 TTGGCCAACTTGTTGGCCCATGACCACTCCATCTCGCTGGCTGGCTGCCTGACCCAAATGT 
ACTTCrTCTTTGCCCTGGGGGTAACTGATAGCTGTCTTCTGGCGQCCATGGCCTATGACTG 
CTACGTGGCCATCCGGCACCCCCTCCCCTATGCCACGAGGATGTCCCGGGCCATGTGCGCA 
GCCCTGGTGGGAATGGCATGGCTGGTGTCCCACGTCCACTCCCTCCTGTATATCCTGCTCA 
TGGCTCGCTTGTCCTrCTGTGCTTCCCACCAAGTGCCCCACTTCTTCTGTGACCACCAGCCT 

35 CTCTTAAGGCTCTCGTGCTCTGACACCCACCACATCCAGCTGCTCATCTTCACCGAGGGCG 
CCGCAGTGGTGGTCACTCCCTTCCTGCTCATCCTCGCCTCCTATGGGGCCATCGCAGCTGC 
CGTGCTCCAGCTGCCCTCAGCCTCTGGGAGGCTCCGGGCTGTGTCCACCTGTGGCTCCCAC 
CTGGCTGTGGTGAGCCTCTTCTATGGGACAGTCATTGCAGTCTACTTCCAGGCCACATCCC 
GACGCGAGGCAGAGTGGGGCCGTGTGGCCACTGTCATGTACACTGTAGTCACCCCCATGC 

40 TGAACCCCATCATCTACAGCCTCTGGAATCGCGATGTACAGGGGGCACTCCGAGCCCTTCT 
CATTGGGCGAAGGATCTCAGCTAGTGACTCCTGA (SEQ ID NO: 418) 



AOLFR224 sequences: 

MGSFNTSFEDGHLVGFSDWPQLEPILFWIFIFYSLTLFGNTinAI^WLDLRLHTPMWFLSHLSL 
45 LDLCFTTSTWQLLINLCGVDRTITOGGCVAQLFIYI^GSTECVLLVA^MAro 
MAIMHPHLCQTLAJL\SWGAGFWSLIQTGLAMAMPLCGHRLNHFFCEM^ 
EAKMFVARVIWAVPAALILGSYVHLAHAV^^ 

YLQSIHNYSEREGKFVAIJ^YTIITPILNPLIYTLRNKDWGALWKVLW (SEQ ID NO: 

419). 



ATGGGAAGTTTCAACACCAGTTTTGAAGATGGCTTCATTTrGGTGGGATTCTCAGATTGGC 
CGCAACTGGAGCCCATCCTGTTTGTCTTTATTTTTATITTCTACTCCCTAACTCT 
AACACCATCATCATCGCTCTCTCCTGGCTAGACCTTCGGCTGCACACACCTATGTACTTCTT 
TCTCTCTCATCTGTCCCTCCTGGACCTCTGCTTCACCACCAGCACCGTGCCCCAGCTCCTGA 
55 TCAACCTTTGCGGGGTGGACCGCACCATCACCCGTGGAGGGTGTGTGGCTCAGCTCTTCAT 
CTACCTAGCCCTGGGCTCCACAGAGTGTGTGCTCCTGGTGGTGATGGCCTTTGACCGCTAT 
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NQIHQIFCDFTPVLSLACTDTFLVVIVDAfflAAEIVASFLVIALSYIRjnVILGMHSAEGHHKAFST 
CAAHLAVFLLFFGSVAVMYLRFSATYSWWDTAIAVTFVILAPFFWIIYSLKNKDMKEAIGRLF 

HYQKRAGWAGK (SEQ ID NO: 411). 

5 ATGAAGCAATATTCAGTGGGTAATCAACATTCCAATrATAGGAGTCTCTTGTTTCCTTTTCT 
GTGTTCACAGATGACACAGTTGACGGCCAGTGGGAATCAGACAATGGTGACTGAGTTCCT 
CTTCTCTATGTrCCCGCATGCGCACAGAGGTGGCCTCTTATTCTTTATTCCCnTGCTTCTCA 
TCTACGGATTTATCCTAACTGGAAACCTAATAATGTTCATTGTCATCCAGGTGGGCATGGC 
CCTGCACACCCCTTTGTATTTCTTTATCAGTGTCCTCTCCTTCCTGGAGATCTGCTATACCA 

10 CAACCACCATCCCCAAGATGCTGTCCTGCCTAATCAGTGAGCAGAAGAGCATTTCCGTGGC 
TGGCTGCCTCCTGCAGATGTACTTTTTCCACTCACTTGGTATCACAGAAAGCTGTGTCCTG 
ACAGCAATGGCCATTGACAGGTACATAGCTATCTGCAATCCACTCCGTTACCCAACCATCA 
TGATTCCCAAACTTTGTATCCAGCTGACAGTTGGATCCTGCTTTTGTGGCTTCCTCCtTGTG 
CTTCCTGAGATTGCATGGATTTCCACCTTGCCTTTCTGTGGCTCCAACCAGATCCACCAGAT 

1 5 aTTCTGTGATTTCACACCTGTGCTGAGCTTGGCCTGCACAGATACATTCCTAGTGGTCATT 
GTGGATGCCATCCATGCAGCGGAAATTGTAGCCTCCTTCCTGGTCATTGCTCTATCCTACA 
TCCGGATTATTATAGTGATTCTGGGAATGCACrCAGCTGAAGGTCATCACAAGGCCTTTTC 
CACCTGTGCTGCTCACCTTGCTGTGTTCTTGCTATTTTTTGGCAGTGTGGCTGTCATGTATT 

TGAGATTCTCAGCCACCTACTCAGTGTrTTGGG^ 
20 CITGCTCCCTTTTTCAACCCCATCATCTATAGCCTGAAAAACAAGGACATGAAAGAGGCTA 
TTGGAAGGCTTTTCCACTATCAGAAGAGGGCTGGTTGGGCTGGGAAATAG (SEQ ID NO: 

412). 



AOLFR221 sequences: 

25 MRNLSGGHVEEFVLVGFPTTPPLQLLLFVLFFAIYLLTLLENALIVFTIWLAPSLHRPMYFFLGH 
LSFLELWYINVTIPRLLAAFLTQDGRVSYVGCMTQLYFFIA1ACTECVLLAVMAYDRYLAICGP 
LLYPSLMPSSLATPXAAASWGSGFFSSMMKLLFISQLSYCGPNIINHFFCDISPLLNLTCSDKEQA 
ELVDFLLALVMILLPLLAVVSSYTAIIAAILRIPTSRGRHKAFSTCAAHLAVWIYYSSTLFTYAR 
PRAMYTFNHNKIISVLYTIIWFFNPAIYCLRNKEVKEAFRKTVMGRCHYPRDVQD (SEQ ID 

30 NO: 413). 

ATGAGAAATTTGAGTGGAGGCCATGTCGAGGAGTTTGTCTTGGTGGGTTTCCCTACCACGC 
CTCCCCTCCAGCTGCTCCTCTITGTCCnriTT^ 

ATGCACTTATTGTCITCACAATATGGCTTGCTCCAAGCCTTCATCGTCCCATGTACTTTTTC 
35 CTTGGCCATCTCTCTrTCCTGGAGCTATGGTACATCAATGTCACCATTCCTCGGCrCTTGGC 
AGCCnTTCTTACCCAGGATGGTAGAGTCTCCTACGTAGGTTGCATGACCCAACTGTACTTC 
TTTATTGCCTTAGCCTGTACTGAATGTGTGCTGTTGGCAGTTATGGCCTATGATCGCTACCT 
GGCCATCTGTGGACCCCTCCTTTACCCTAGTCTCATGCCTTCCAGTCTGGCCACTCGCCTTG 
CTGCTGCCTCTTGGGGCAGTGGCTTCTTCAGCTCCATGATGAAGCTTCTTTTTAm 
40 TTGTCCTACTGTGGACCCAACATTATCAACCACTTTTTCTGTGATATTTCCCCACTACTCAA 
CCTCACCTGCrcrGACAAGGAGCAAGCAGAGCTAGTAGACTTCCTTCTGGCCCTGGTGATG 
ATTCTACrCCCTCTATTGGCTGTGGTTTCATCATACACTGCCATCATTGCAGCCATCCTGAG 
GATCCCTACGTCCAGGGGACGCCACAAAGCCTTTTCCACTTGTGCCGCTCATCTGGCAGTG 
GTTGTTATCTACTACTCCTCCACTCTCTTCACCTATGCACGGCCCCGGGCCATGTACACCTT 
45 CAACCACAACAAGATTATCTCTGTGCTCTACACTATCATTGTACCATTCTTCAACCCAGCCA 
TCTACTGCCTGAGGAACAAGGAGGTGAAGGAGGCCTTCAGGAAGACAGTGATGGGCAGAT 
GTCACTATCCTAGGGATGTTCAGGACTGA (SEQ ID NO: 414). 



AOLFR222 sequences: 

50 MGQTNVTSWPJ)FWLGFSSSGELQLLLFALFL£L^ 

SFSETCYTLGIIPPJVfl-SGLAGGDQAISYVGCAAQMFFSASWACTNCTIXAAMGFDRYVAICAPL 
HYASHMNPTLCAQL\^SFLTGYIFGLGMTLVIFHI^FCSSHEIQHFFCDTPPVLSLACGDTGPS 
ELRIFII^LLVLLVSFFnTISYAYIIAAILRIPSAEGQKKAFSTCASHLTVVnHYGCASFVYLRPK 
ASYSLERDQLIAMTYTVVTPLLNPIVYSLRTRAIQTALRNAFRGRLLGKG (SEQ ID NO: 415). 
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AOLFR218 sequences: 

METANYTKVTEFVLTGLSQTREVQLVLFVIF^ 

LLDlWYSSITAPKMLroFFVERKJISFGGCMQLFFLHFVGASEMFLLIVMAyDRYAAJCRPLHYA 
5 TlMNRRLCCrLVAI^WMGGFIHSIIQVAl.I\aU.PFCGPNELDSWCDITQ\^lIACANTFPEELVM 
ICSSGLISWCFLALLMSYAFELALUCKHSGSDENTMIAMSTC 

SFSLDKWSWHTVIFPLLNPIiyTLPvNKEVKAAMRKVVTKYELCEEK (SEQ ID NO: 407) 

ATGGAAACTGCAAATTACACCAAGGTGACAGAATTTGTTCTCACTGGCCTATCCCAGACTC 

1 0 GGGAGGTCCAACTAGTCCTATTTGTTATATTTCTATCCTTCTATTTGTTCATCCTACCAGGA 

AATATCCTTATCATTTGCACCATCAGGCTAGACCCTCATCTGACITCTCCTATGTATTTCCT 

GTTGGCTAATCTGGCCCTCCTTGATATTTGGTACTCFTCCATTACAGCCCCTAAAATGCTCA 

TAGACTTCrTTGTGGAGAGGAAGATAATTTCCTTTGGTGGATGCATTGCACAGCTCTTCTT 

CTTACACTTTGTTGGGGCTTCGGAGATGTTCTTGCTCATAGTGATGGCCTATGACCGCTAT 

GCTGCTATCTGCCGACCCCTCCACTATGCTACCATCATGAATCGACGTCTCTGCTGTATCCT 

GGTGGCTCTCTCCTGGATGGGGGGCTTCATTCATTCTATAATACAGGTGGCTCTCATTGTT 

CGACTTCCITTCTGTGGGCCCAATGAGTTAGACAGTTACTTCTGTGACATCACACAGGTTG 

TCCGGATTGCCTGTGCCAACACCTTCCCAGAGGAGTTAGTGATGATCTGTAGTAGTGGTCT 

GATCTCTGTGGTGTGTTTCATTGCTCTGTTAATGTCCTATGCCTTCCTTCTGGCCTTGCTCA 

AGAAACATTCAGGCTCAGATGAGAATACCAACAGGGCCATGTCCACCTGCTATTCCCACAT 

TACCATTGTGGTGCTAATGTTTGGGCCATCCATCTACATTTATGCTCGCCCATTTGACTCAT 

TTTCCCTAGATAAAGTGGTGTCTGTGTTTCATACTGTAATATTCCCrTTACTTAATCCCATT 

ATTTACACATTGAGAAACAAGGAAGTAAAGGCAGCCATGAGGAAGGTGGTCACCAAATAT 
ATTTTGTGTGAAGAGAAGTGA (SEQ ID NO: 408). 

AOLFR219 sequences: 

MLTSLTDLCFSPIQVAEIKSLPKSMNETNHSRVTEFVLLGI^SSRELQPFIJFLTFSLLYLAn^ 

LIILTVTSDSRLHTPMYFLLAhTLSFIDVCVASFATPKMIADFLVEPJCTISFDACIAOIFFVHLFTGS 

EM\^LVSMAYDRYVAICKPLHYMTVMSRRVCVVLVLISWFVGFmTTSQLAFTVNLPFCGPN 

KVDSFFCDLPLVTKLACroTYWSLLIVADSGFLSI^SFLLLWSYTVILVTVPJsJRSSASMAKAR 
STLTAMTWTLFTGPCinYVWPFSSY 

RYLKPSQVSWIRNVLFLETK (SEQ ID NO: 409). 
- - ATGCTCACTTCATTAACTGATCTCTGTTTCTCT 

35 TCCAAAATCGATGAATGAGACAAATCATTCTCGGGTGACAGAATTTGTGTTGCTGGGACTG 
TCTAGTTCAAGGGAGCTCCAACCTTTCTTGTTTCTTACATTTTCACTACTTTATCTAGCAAT 
TCTGTTGGGCAACTITCTCATCATCCTCACTGTGACCTCAGATTCCCGCCTTCACACCCCCA 
TGTACTTTCTGCrrTGCAAACCTGTCATTTATAGACGTATGTGTTGCCTCTTTTGCTACCCCT 
AAAATGATTGCAGACTTTCTGGTTGAGCGCAAGACTATTTCTTTTGATGCCTGCCTGGCCC 

40 AGATTTTCTTTGTTCATCTCTTCACTGGCAGTGAAATGGTGCTCCTAGTTTCCATGGCXrrAT 
GACCGTTATGTTGCTATATGCAAACCTCTCCACTACATGACAGTCATGAGCCGTCGTGTAT 
GTGTTGTGCTCGTCCTCATTTCATGGTTTGTGGGCTTCATCCATACTACCAGCCAGTTGGCA 
TTCACTGTTAATCTGCCATTTTGTGGTCCTAATAAGGTAGACAGTTTTTTCTGTGACCTTCC 
TCTAGTGACCAAGTTAGCCTGCATAGACACrTATGTTGTCAGCTTACTAATAGTTGCAGAT 

45 AGTGGci 1 i vl'iTCTCTGAGTTCCTTTCTCCTCTrGGlTGTCTCCTACACTGTAATACTTGTT 
ACAGTTAGGAATCGCTCCTCTGCAAGCATGGCGAAGGCCCGCTCCACATTGACTGCTCACA 
TCACTGTGGTCACTTTATTCTTTGGACCATGCATTTTCATCTATGTGTGGCCCrTCAGCAGT 
TACTCAGTTGACAAAGTCCTTGCTGTATTCTACACCATCTTCACGCTTATTTTAAACCCTGT 
AATCTACACGCTAAGAAACAAAGAAGTGAAGGCAGCTATGTCAAAACTGAAGAGTCGGTA 

AOLFR220 sequences: 

MKQYSVGNQHSNYRSLLFPFECSQMTQLTASGNQTMVTEFLFSMFPHAHRGGLLFFIPLLLIYG 
55 FILTGNLIMFIVIQVGMAIJITPLYFTISVLSFLEICYTTTTIPKMI^CLISEQK^ 

HSLGITESCVLTAMAIDRYIAICJ^LRYPTIMIPKLCIQLTVGSCTCGFELVLPEIAWISTLPFCGS 
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ATATCATTGAGCTGTTTCCTGGCTTTAATTATTTCCTACACCATCATTTTGATCGGTC 
ATGCAGGTCCTCCAGTGGGTCATCTAAGGCTCTTTCTACATTAACTGCCC 

G^ACAAATTTCrrTCT 
CCTGGAAAAACTAG (SEQ ID NO: 402) 

AOLFR216 sequences: 

MDVGNKSTMSEFVLLGLSNSWELQMFFFMWSLLYV^ 
10 SnDMSLASFATPKMITDYLTGHKTlSFDGCLTQIITLHLFTGTEIILLMAMSFDRY 

^svlspov^ 

£S^^ < seq id NO: 403) 

i * 4T rr ATnTfiGnCAATAAGTCTACCATGTCTGAATTrGTTTTGCTGGGGCTCTCTAATTCCT 
G^SIcAGATC^ 

TGCTOACCAATCTC 

apaI^aot^caggtcacaaaaccatct 

A^C^TA^A^GCCCCTGCACTAT^ 
CGTGGTGGC^CCTGGATTATGGGAGTTATGCATTCAA^ 

ACGTTACCATTCTGTGGTCCCTATGAGGTAGACAGCTTTTTCTGTGACCTrCCTGTGGT 
CCAGTTGGCrTGTGTGGATACTTATGTTCTGGGCCTCrTT 
25 ATTGCGTTGTGCTGTTITT^ 

rCATCATTCTTCCAGAGGATC 

GAC^A^CTGTCTGTGTrrTATACCATCTTTACTCCCACT^ 

mG^GGAA^ 
30 TAATAAGGCAATGCCTTCATAG (SEQ ID NO: 404) 

AOLFR217 sequences: 
(SEQ ID NO: 405) 

40 ATGCTAGAGTCCTTCCAGAAATCAGAGCAAATGGCCTGGAGC^ 
GAATTCATACTACGGGGTCTGTCCAGTTCITTAGAACTC 
^rATA^GTCTATG^CAGCCACTGTGCT 

CCTGGCCTCATTTGCCACCCCCAAAATGATTGCAGACTrCCTrAGAGAACACAAAGCCArc 
45 TCTTTTGAAGGCTGCATGACCCAGATGTTCn^ 

TA1ACACTCCCCTCTTGAATCCAGTGATCT 
55 CATGAGGAAACTAAGCAGCCATATCTTTAAATCTAGG 

(SEQ ID NO: 406) 
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TTAACTTGCAAAGACCAAAGAAGACGATCACTTACGGTGGTTGTGTGGCGCAACrCTATAT 

TTCTCTGGCACTGGGCTCCACTGAATGTATCCTCTTGGCTGACATGGCCITGGATCGGTAC 

ATTGCTGTCTGCAAACCCCTCCACTATGTAGTCATCATGAACCCACGGCTTTGCCAACAGC 

TGGCATCTATCTCCTGGCTCAGTGGTTTGGCTAGTTCCCTAATCCATGCAACTTTTACCn^G 

CAATTGCCTCTCTGTGGCAACCATAGGCTGGACCATTTTATrrGCGAAGTACCAGCTC^^ 

C^OTGGCTTGTGTGGACACCACTGTCAATGAATTGGTGCT^ 

TTCTOIX^TTOCACCAGCACT^TC^ 

AGGATCAAATCAGTAGAGGCAAGGCATAAAGCCTTCAGCACCTGCTCCTCCCACCTTACArs 

TGGTGATTATATTCTATGGCACCATAATCTACGTGTACCTGCAACCTAGTGACAGCTA^^^ 

CCAGGACCAAGGGAAGTTTATCTCCCTCTTCTACACCATGGTGACCCCCACTTTAAATCCT 

ATCATCTATACTTTAAGGAACAAGGATATGAAAGAGGCTCTGAGGAAACTTCTCTCGGGA 
AAATTGTGA (SEQ ID NO: 398) ^OGGA 

AOLFR214 sequences: 

MDKSNSSWSEF\a,LGLCSSQKLQLFYFCFFSVLYTVlVLGNLLULTVTSDTSLJHSPMYFLLGN 
l^™iCQASFATTKMlADFI^AHETISFSGCIAQDTmLFrGGEMVlXVSMAYDRYVAIOO>LY 
YWIMSRRTCTVLVMISWAVSLVHTLSQLSFTVNIJ»FCGPNVVDSFFCDLPRVTKIACLDS 
ILI\^SGILSLSTFSLLVSSYIIILVTVWLKSSAAMAKAFSTLAS 

PLD^^YTVFTP\^NPIIYTLRNRDMKAAVRKIV^ (SEQ ^ 

ATGGATAAGTCCAATTCTTCAGTGGTGTCTGAATTTGTACTGTTGGGACTCTGTAGTTCTC 
AAAAACTCCAGCTTTTCTATTTTTGTTTCTT 

AATCTrCTCATTATCCTCACAGTGACTTCTGATACCAGCCTGCACTCCCCTATGTACTTTCT 
CITGGGAAACCTTTCCTTTGTTGACATTTGTCAGGCTTCTTTTGCTAC 

C A ^'™^ GAGTG CACACGAGACCATATCTrrCAGTGG(?rGCATAGCCCAAATrrrcm 

ATTCACCTTTTTACTGGAGGGGAGATGGTGCTACTTGTTTCGATGGCCTATGACAGGTATG 

TAGCCATATGCAAACCCTTATACTATGTGGTCATCATGAGCCGAAGGACATGCACTGTCTT 

GGTAATGATCTCCTGGGCTGTGAGCrTGGTGCACACATTAAGCCAGTTATCATTTACTGTG 

AACCTGCCTTTTTGTGGACCTAATGTAGTAGACAGCTTTTTTTGTGATCTTCCTCGAGTrc 

CAAACTTGCCTGCCTGGACTCTTACATCATTGAAATACTAATTGTGGTCAATAGTGGAATT 
CT^CCCTAAGCACITTCTCrCTCTTGGTCAGCTCCTACATC^ 

GCTCAAGTCTTCAGCTGCAATGGCAAAGGCATTTTCTACGCTGGCTTCCCATATTGCAGTA 

GTAATATTATTCTTTGGACCTTGCATCTTCATCTATGTGTGGCCCTrrACCATCTCTCCTrT 
^?^^^ OTGCCATA ' mACACTG ' imCACCC CCGTCCTAA^ 

CACTAAGGAATAGGGATATGAAGGCTGCCGTAAGGAAAATTGTGAACCATTACCTGAGGC 
CAA G G AG AATTTCTG AAATGTCA CTAGTA GTGAG AACTTCCTTTCATTAA (SEQ ID NO- 



AOLFR215 sequences: 

MAHTOESMVSEFVLLGLSNSWGLQLFFFAIFSIVYVTSVLGNVLir^SFDSHLNSPMYFIXSNL 
SFmiCQSNFATPKMLVDFFIERKTISFEGCMAQIFVLHSFVGSEMMLLVAMAYDRFIAICKPLH 
YSTIMNRRLCVIFVSISWAVGVLHSVSHIAFIVDLPFCGPNEVDSFFCDLPLVIELACMDTYEM 
EIMTLTNSGUSl^CFLALHSYTnLIGWCRSSSGSSKAI^TLTAHITVVILFFGPCnTY 
PVT>KFI^VF/WCTPLLWIIYSLRNEDVKAAMWKLRNHHVNSWKN (SEQ ID NO: 401) 

ATGGCTCACACAAATGAATCGATGGTGTCTGAGTTTGTACTITTGGGACTCTCTAATTCCT 

GGG ° A j^ CAACI ^ 

AATGTCTTAA7TATTGTCATTATTTCTTTTGACTCCCATTTGAACTCTCCTATGTACTTCTTG 

AGACi 1 1 1 TVf^GAGCGCAAGACTATCTCCTTTGAGGGTTGCATGGCCCAGATATrCGTT 

CTTCACAGTTTTGTTGGGAGTGAGATGATGTTGCTTGTAGCTATGGCATATGACAGATTTA 

TAGCCATATGTAAGCCTCTGCACTACAGTACAATTATGAACCGGAGGCTCTGTGTAATTTT 

TGTGTCTATTTCCTGGGCGGTGGGCGTTCTTCATTCTGTGAGCCACTTGGCITTTACAGTGG 

ACCTGCCATTCTGTGGTCCCAATGAGGTGGATAGCTTCTTTTGTGACCTTCCCTTGGTGATA 

GAGCTGGCTTGCATGGATACATATGAAATGGAAATTATGACCCTAACGAACAGTGGCCTG 
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ATGATGGGTAGAAGGAATAACACAAATGTGGCTGACTTCATCCTTATGGGACTGACACTTT 
CTGAAGAGATCCAGATGGCTCTGTTTATGCTATTTCTCCTGATATACCTAATTACTATGCTG 
GGGAATGTGGGGATGATATTGATAATCCGCCTGGACCTCCAGCTTCACACTCCCATGTATT 
5 TTTTCCTTACTCACCTGTCATTTATTGACCTCAGTTACTCAACTGTCGTCACACCTAAAACC 
TTAGCGAACTTACTGACTTCCAACTATATTTCCITTACGGGCTGCTTTGCCCAGATGTTCTT 
TTTTGCCnTCTTGGGTACTGCTGAATGTTACCTTCTCTCCTCAATGGCCCATGATCGCTATG 
CAGCGATCTGCAGTCCTCTACACTACACAGTTATTATGTCCAAAAGGCTCTGCCTCGCTCT 
CATCACTGGGCCnTATGTGATTGGCTTTATAGACTCCTTTGTCAACGTGGTTTCCATGAGCA 

10 GATTGCATTTCTACGACTCAAACGTAATTCATCACTTTTTCTGTGACACTTCCCCAATTT^ 

GCTCTGTCCTGCACTGATACATACAACACCGAAATCCTGATATTCATTATTGTTGGTTCCAC 
CCTGATGGTGTCCCTTTTCACAATATCTGCATCCTATGTGTTCATTCTCTTTACCATCCTGA 
AAATTAATTCCACTTCAGGAAAGCAGAAAGCTTTCTCTACTTGCGTCTCTCATCTCTTGGG 
AGTCACCATCTTTTATAGCACTCTGATTTTTACTTATTTAAAACCAAGAAAGTCTTATTCCT 

15 TGGGAAGAGATCAAGTGGCTTCTGTTTTTTATACTATTGTGATTCCCGTGCTGAATCCACT 
CATTTATAGTCTTAGAAACAAAGAGGTGAAAAATGCTGTCATCAGAGTCATGCAGAGAAG 

ACAGGACTCCAGGTAA (SEQ ID NO: 394) 
AOLFK212 sequences: 

20 MAGNNITEVTWILSGFAhlHPELQVSLFlMFLFIYLFTVLGNLGLITURMDSQlJlTPMYFFL^ 
LAFmiFYSSTVTPKALVNFQSN^SISFVGCFVQMYFFVGLVCCECFLLGSMAYNRYIAICNPL 
LYSVVMSOKVSNWLGVMPYVIGFTSSLISVWVISSLAFCDSSINHFFCDTTALLALSCVDTFGT 
EMVSFVLAGFTLLSSLLIITVTYIIIISAILRIQSAAGRQKAFSTCASHLMAVTIFYGSLIFTYLQPD 
NTSSLTQAQVASWYTI\aPMLNPLIYSLRNKDVKNALLRVlHRKLFP (SEQ ID NO: 395) 

25 

ATGGCTGGCAACAATTTCACTGAGGTTACCGTCTTCATCCT 
CTGAATTACAAGTCAGTCITTTCTTGATGTTTCTCTTC^ 

AACCTGGGACTGATCACGTTAATCAGAATGGATTCTCAGCTTCACACCCCrATGTACTTTT 
TCCTGAGCAATTTAGCATTTATTGACATATTTTACTCCTCTACTGTAACACCTAAGGCATTO 

30 GTGAATTTCCAATCCAATCGGAGATCCATCTCCTTTGTTGGCTGCTTTGTTCAAATGTACTT 
TTTTGTTGGATTGGTGTGTTGTGAGTGTTTCCTTCTGGGATCAATGGCCTACAATCGCTACA 
TAGCAATCTGCAATCCCTTACTGTATTCAGTAGTCATGTCCCAAAAAGTGTCCAACTGGCT 
GGGAGTAATGCCATATGTGATAGGCTTCACAAGCTCGCTGATATCTGTCTGGGTGATAAGC 
AGTTTGGCGTTCTGTGATTCCAGCATCAATCATTTTTTTTGTGACACCACAGCTCTTTTAGC 

35 ACTCTCCTGTGTAGATACATTCGGCACAGAAATGGTGAGCTTTGTCTTAGCTGGATTCACT 
CTTCTTAGCTCTCTCCTTATCATCACAGTCACTTATATCATCATCATCTCAGCCATCCTGAG 
GATCCAGTCAGCAGCAGGCAGGCAGAAGGCCTTCTCCACCTGCGCATCCCACCTCATGGCT 
QTAACTATCTTTTATGGGTCTCTGATTTTCACCTATTTGCAACCTGATAACACATCATCGCr 
GACCCAGGCGCAGGTGGCATCTGTATTCTATACGATTGTCATTCCCATGCTGAATCCACTC 

40 ATCTACAGTCTGAGGAACAAAGATGTGAAAAATGCTCTTCTGAGAGTCATACATAGAAAA 

CTTTTTCCATGA (SEQ ID NO: 396) 
AOLFR213 sequences: -— 

MNSLGKiVSMILSAHVFCYSKFNCFGCTHSIPALGADPPGGMGLGNESSLMDFILLGFSDIB^L 
45 EAVLFWVLFFYLLTLVGNFTimSYLDPPLHTPMWFLSNLSLLDICFTTSLAPQTLVNLQRPKK 
TITYGGCVAQLYISLALGSTECILLADMALDRY1AVCKPLHYVVIMNPRLCQQLASISWLSGLA 
SSLIHATFTLQLPLCGNHRLDHFICEWALLKLACVDTTVNELVLFWSVLFW1PPAU 
TQAVLWKS^ARHKAFSTCSSHLTNmiFYGTIIYVYLQPSDSYAQDQGKFISLFYTMVTPTLNP 

HYTLRNKDMKEALRKLLSGKL (SEQ ID NO: 397) 

50 

ATGAATAGTTTGGGAAAGTTGGTCTCCATGATCCTCTCAGCTCATGTGTTCTGTTATTCTAA 
ATITAATTGTTTTGGATGTACCCATTCCATTCCrGCCTTAGGTGCGGATCCCCCTGGAGGG 
ATGGGATTGGGCAATGAGAGTTCCCTAATGGATTTCATCCTTCT^ 
CTCGTCTGGAGGCTGTTCTCTTTGTATTTGTCCTTTTCTTCTACCTCCT 

55 AACTTCACCATAATCATCATCTCATATCTGGATCCCCCTCT^ 

^CAGCAACCTCTCTlTACTGGACATCTGCTTCACTACTAGCCTTGCTCCrrCAGACCTTAG 
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AOLFR209 sequences: 

MDKINQTFVREFILLG1^GYPKLEIIFFALIL\^YVVILIGNGVLIIASILDSRLHMPMYFFLGNLS 
FLDICYTTSSIPSTLVSLlSKKIWlSFSGCAVQMITGFAMGSTECTLLGMMAFDRyVAICl^mY 
5 PnMNKWYVLLTSVSWI^GGmSTVQTSLAMRWPFCGNNIITvIHFLCEIlAVIJaACSDISVNIV 
TIA.VSNlAFLVlJ'LLVIFFSYMFILYTILRTNSATGRHKAfSTCSAHLTV^ 

QDLLGKD^QATEGLVSMFYGV^PMLNPJIYSLRNKDVKAAIKYLLSRKAJNQ (SEQ ID NO: 
389) x 

1 0 ATGGACAAGATAAACCAGACATTTGTGAGAGAATTCATTCTTCTGGGACTCTCTGGTTACC 
CCAAACTTGAGATCATTTTCTTTGCTCTGATTCTAGTTATGTACGTAGTGATTCTAATTGGC 
AATGGTGTTCTGATCATAGCAAGCATCTTGGATTCTCGTCTTCACATGCCCATGTACrTCTT 
CCTGGGCAACCTCTCTTTCCTGGATATCTGCTATACAACCTCCTCCATTCCCTCAACACTGG 
TGAGCTrAATCTCAAAGAAAAGAAACATTTCCTTCTCTGGATGTGCAGTGCAGATGTTCTT 

1 5 TGGGTTTGCAATGGGGTCAACAGAATGTTTCCTCCTTGGCATGATGGCATTTGATCGTTAT 
GTGGCCATCTGTAACCCTCTGAGATACCCCATCATCATGAACAAGGTGGTGTATGTACTGC 
TGACTTCTGTATCATGGCTTTCTGGTGGAATCAATTCAACTGTGCAAACATCACTrGCCAT 
GCGATGGCCITTCTGTGGGAACAATATTATTAATCATTTCITATGCGAGATCTTAGCTGTCC 
TAAAATTAGCTTGTTCTGATATATCTGTCAATATTGTTACCCTAGCAGTGTCAAATATTGCT 

20 TTCCTAGTTCTTCCTCTGCTCGTGATTTTTTTCTCCTATATGTTCATCCTCTACACCATCTTG 
CGAACGAACrCGGCCACAGGAAGACACAAGGCATTTTCTACATGCTCAGCTCACCTGACTG 
TGGTGATCATATTTTATGGTACCATCTTCTTTATGTATGCAAAACCTAAGTCCCAGGACCTC 
CTTGGGAAAGACAACITGCAAGCTACAGAGGGGCTTGTTTCCATGTTTTATGGGGTTGTGA 
CCCCCATGTTAAACCCCATAATCTATAGCTTGAGAAATAAAGATGTAAAAGCTGCTATAAA 
25 ATATTTGCTGAGCAGGAAAGCTATTAACCAGTAA (SEQ ID NO: 390) 

AOLFR210 sequences: 

MMGRRM5TOVADFILTGLSDSEEVQMALFMLFLLIYLITMLGNVGMLLIIRLDLOLHTPMYFFL 
THLSFIDLSYSTWTPKTLANLLTSNYISIT'GCFAQMFCFWLGTAECYLLSSMAYDRYAAICSP 
30 LHYTVIMPKRLCLALITGPYVIGFMDSFVN\^SMSRLHFCDSNIlHHFFCDTSPILAI^CrDTDN 
TEMLIFIL^GSTLMVSLmSASYVSILSTILKINSTSGKQKAFSTCVSHLLGVTIFyGTMIFTYLKP 
RKSYSLGRDQVAPVFYTIVIPMLNPLIYSLRNREVKNALIRVMQRRQDSR (SEQ ID NO: 391) 

ATGATGGGTAGAAGGAATGACACAAATGTGGCTGACTTCATCCTTACGGGACTGTCAGAC 

35 TCTGAAGAGGTCCAGATGGCTCTGTTTATGCTATTTCTCCTCATATACCTAATTACTATGCT 

GGGGAATGTGGGGATGCTATTGATAATCCGCCTGGACCTCCAGCTTCACACTCCCATGTAT 

1 1 111 CCTTACTCACCTGTCATTTATTG ACCTCAGTTACTCAACTGTCGTCACACCTAAAAC 

CTT^CGAACTTACTGACTTCCAACTATATTTCCTTCACGGGCTGCTrTGCCCAGATGTTCT 

GTTTTGTCTTCTTGGGTACTGCTGAATGTTATCTTCTCTCCTCAATGGCCTATGATCGCTAT 

40 GCAGCGATCTGCAGTCCTCTACACTACACAGTTATTATGCCCAAAAGGCTCTGCCTCGCTC 

TCATCACTGGGCCTTATGTGATTGGCTTTATGGACTCCTTTGTCAATGTGGTTTCCATGAGC 

AGATTGCATTTCTGTGACTCAAACATAATTCATCACTTTTTCTGTGACACTTCCCCAATTTT 

AGCTCTGTCCTGCACTGACACAGACAACACTGAAATGCTGATATTCATTATCGCTGGTTCC 

ACCCTGATGGTGTCCCTTATCACAATATCTGCATCCTATGTGTCCATTCTCTCTACCATCCT 

45 GAAAATTAATTCCACrrCAGGAAAGCAGAAAGCTTTCTCTACTTGCGTCTCTCATCTCTTG 

GGAGTCACCATCTTCTATGGAACTATGATTTTTACTTACTTAAAGCCAAGAAAGTCTTATT 

CCrTGGGAAGAGATCAAGTGGCTCCTGTGTTTTATACTATTGTGATTCCCATGCTGAATCC 

ACTCATTTATAGTCTTAGAAACAGAGAAGTGAAAAATGCTCTCATTAGAGTCATGCAGAG 
AAGACAGGACTCCAGGTAG (SEQ ID NO: 392) 

50 

AOLFR211 sequences: 

MMGRRhn^TTNVADFmMGLTL^FJEIQMAIJMIJF^ 

THLSFIDLSYSTWTPKTIANLLTS>mSFTGCFAQMFFFAFLGTAECYLLSSMAHDRYAAICSP 
LHYTVIMSKPJ.CLALITGPYVIGFIDSFWWSMSPXHFYDSNVIHHFFCDTSPILAL^ 
55 EILIFIWGSTLMVSLFnSASYVFILFITLKmSTSGKQK^ 

SYSLGRDQVASVFYTIVIPVLNPIJYSIJRNKEVKNAVIRVMQRRQDSR (SEQ ID NO: 393) 
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AAAGCAGTTGCTGTATTCTACACTATGATAACrTCTATGTTAAACCCCTTAATCTACACCTT 

ga^gaItgctca^tgaaaaatgccattaggaaattg^ 
tgtcaaataa (seq id no: 384) 

5 AOLFR207 sequences: 

MFRTNDSTSTEFFLVGLSAHPKLQTVFFVLILWMYLMILLGNGVLISV1IFDSHLHTPMYFFLCN 

lsf^y^ 
s^ismt^^ 

10 Ssv^sgnediiealislfygvmtpmlnpliyslrn^ (SEQ ID 

NO: 385) 

ATGGAAAGGACCA^CGATTCCACGTCGACAGAATTrTTCCTGGTAGGGCTTTCTGCCCACC 

15 AATCGAGTCCTTATCTCAGTTATCATCITTGATTCT 
CCTCTGTAA^^ 

CCAGCTTTCTGGCAGTAAAGAAAAAGGTTTCCTTCTCTG 

^C^SroSATGGGGGCCACGGAGTGCATGATCTTAGGCACGATGGCACTGGACCGCTAT 

20 tggcagctgggtcctgggtcactgggcttgtggactcagtagtgcagacagct^ 
gcagttaccattctgtgctaataatg 

TGTTCTGGrTATrcCATTGTTAGTAA 

ctgttgaScaggtaatgaagacatcattgaggccct^^ 
gISScatgcttaatcctctcatctatagtctgcgaaa^ 

AAAAACATACTGTGTAGGAAAAACTTTTCTGATGGAAAATGA (SEQ ID NO: 386) 

30 AOLFR208 sequences: 

MFPANWTSWWFFLGFFHYPKVQVIIFAVCLLMYLITLLGNIFLISITILDSHLHTPMYLFLSNL 

WLDIWYSSSA^ 

LVOTJMLVISVLLLPMPMLLICISYAFILASILW 
35 lS^QE^ (SEQ 
ID NO: 387) 

a Tr;TTrrrnGCAAATTGGACATCTGTAAAAGTATTTTTCTTCCTGGGA 1 1 ITTI CACTACCC 
C^AAGTTCAGCT'CATC 

40 a^^ctJa^ 

cctcagcaaVctctcctttctggacatctggtactcctot 

caaacittgttccagggagaaacact 
SSg^aT^ 

45 ttgcagctggctcctggatgacaggctgtctcactgccatggtggaaatgatgtctgt^ 

GCCACTGTCTCTCTGTGGTAATAGCATCATCAATCATTTCA 

?c^^oxatgccaatgctactca™ 
^gag^tcaotcagtggaaggtcgaa 
50 tggtggtagttttgttctatgggacgggtctctccatgcacctgaagc^ 

T'TCACAGGAAATAGACAAATTT^ 

otatcatc^ta^tct^gaacaaagaggtgaaag 

GAAATCATTTTAATACTGCCTTCATTTCCATCCTCAAATAA (SEQ ID NO: 388) 
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CACATTTTTGGTGCTACTGAGATCATCCTGCTGACAGTGATGGCCTGTGACTGCTATGTGG 

CCATCTGCAAACCTCTGAACTACACAACCATTATGAGCCACAGCCTGTGCATTCTCCTGGT 

GGCAGTGGCCTGGGTGGGAGGATTTCTTCATGCAACTATTCAGATTCTCTTTACAGTATGG 

CTGCCCTTCTGTGGCCCCAATGTCATAGGCCACTTCATGTGTGACTTGTACCCATTGTTAAA 

5 ACTTGTTTGCATAGACACTCATACCCTTGGTCTCTTTGTTGCTGTGAACAGTGGGriTATCT 

GCTTATTAAACTTCCTTATCTTGGTGGTATCCTATGTGATCATCnTGAGATCTTTAAAGAAC 

AATAGCTTGGAGGGGAGGTGTAAAGCCCTCTCCACCTGTATTTCTCACATCATAGTAGTTG 

TCTTATTCTTTGTGCCCTGTATATTTGTGTATCTGCGCTCAGTGACCACTCTGCCCATTGAT 

AAAGCTGTTGCTGTATTTTATACTATGGTGGTCCCAATGTTAAATCCCGTGGTCTACACAC 

1 0 TCAGAAATGCTGAGGTAAAAAGTGCAATAAGGAAGCTTTGGAGAAAAAAAGTGACTTCAG 
ATAATGATTAA (SEQ ID NO: 380) ~™^^u oac i i tAO 



15 



20 



AOLFR205 sequences: 

MESENRTVIREFILLGLTQSQDIQLLVFVLVLffW 

DASYSFTVAPRMLVDFLSAKKJISYRGCITQLFFLHFLGGGEGLIXVVMAFDRYIAICRPLHYPT 
VMNPRTCYAMM1^LWLGGFVHSIIQWLILIU.PFCGPNQLDNFFCDWQVIKLACTDTFWEL 
LMVFNSGLMTLLCFLGLI^SYAVILCRIRGSSSEAK>nCAMSTCITHIIVIFFMFGPGIFIYTRPFRA 
FPADKWSLFHTVIFPLLNPVIYTLRNQEVKASMKKVFNKHIA (SEQ ID NO: 381) 

ATGGAAAGCGAGAACAGAACAGTGATAAGAGAATTCATCCTCCTTGGTCTGACCCAGTCT 

CAAGATATTCAGCTCCTGGTCTTTGTGCTAGTTTTAATATTCTACTTCATCATCCTCCCTGG 

AAATTTTCrCATTATTTTCACCATAAAGTCAGACCCTGGGCTCACAGCCCCCCTCTATTTCT 

TTCTGGGCAACTTGGCCTTCCTGGATGCATCCTACTCCTTCACTGTGGCTCCCCGGATGTTG 

GTGGACTrCCTCTCTGCGAAGAAGATAATCTCCTACAGAGGCTGCATCACTCAGCTCTTTT 

25 TCTTGCACrrTCCrrGGAGGAGGGGAGGGATTACTCCTTGTTGTGATGGCCTTTGACCGCTA 

CATCGCCATCTGCCGGCCTCTGCACTATCCTACTGTCATGAACCCTAGAACCTGCTATGCA 

ATGATGTTGGCTCTGTGGCrTGGGGGTTTTGTCCACTCCATTATCCAGGTGGTCCTCATCCT 

CCGCTTGCCTTTTTGTGGCCCAAACCAGCTGGACAACTTCTTCTGTGATGTCCCACAGGTC 

ATCAAGCTGGCCTGCACCGACACATTTGTGGTGGAGCTTCTGATGGTCTTCAACAGTGGCC 

TGATGACACTCCTGTGCTTTCTGGGGCTTCTGGCCTCCTATGCAGTCATTCTTTGTCGCATA 

CGAGGGTCTTCTTCTGAGGCAAAAAACAAGGCCATGTCCACGTGCATCAGGCATATCATTG 

TTATATTCTTCATGTTTGGACCTGGCATCTTC^TCTACACGCGCCCCnTCAGGGCTrTCCCA 

GCrGACAAGGTGGTTTCTCTCTTCCACACAGTGATTTTTCCTTTGTTGAATCCTGTCATrTA 

^CCCTTCGCAACCAGGAAGTGAAAGCTTCCATGAAAAAGGTGTTTAATAAGCACATAGC 
35 CTGA (SEQ ID NO: 382) 

AOLFR206 sequences: 

MAhJRNNVTEFn-LGLTEWKMQKIIFVWSVIYTNAMIGNVLr^ 

DACYSSVmTKLITDSLYENKTILFNGCMTQVFGEHFFRGVEVU.LTVMAYDHW 
TlMKOHVCSLLVGVSWVfrOFT HATrnn vnmr vvi-rro-Kr'.m^inmt^r.-, ™.tt . «~^v^ — „ 



30 



40 



— - ' " " ^"uoi-j ci^ft.iiu'iAiuv-Miy vfuJK.iii't'KUVliVll^LTVMAYDHYVAICKPLHYT 
TIMKQHVCSLLVGVSWVGGFLHATIQILnCQLPFCGPNVmHFMCDLYTlJM^C^ 
IAANSGHCLLNCLLLLVSCVVILYSLKTHSLEAPJffiL\LS^ 

PIDKAVAVFYTMITSMLNPLIYTT.RNAQMKNAIRKLCSRXAISSVK (SEQ ID NO- 383) 



ATGGCGAATAGAAACAATGTGACAGAGTTTATTCTATTGGGGCTTACAGAGAATCCAAAA 
45 ATGCAGAAAATCATATTTGTTGTGTTTTCTGTCATCTACATCAACGCCATGATAGGAAATG 
TGCrCATTGTGGTCACCATCACrGCCAGCCCATCACTGAGATCCCCCATGTACTTTTTCCTG 
GCCTATCTCrCCTTTATTGATGCCTGCTATTCCTCTGTCAATACCCCTAAGCTGATCACAGA 

CATTTTTTCAGAGGTGTTGAGGTCATCCTACTTACTGTAATGGCCTATGACCACTATGTGG 
50 CCATCTGCAAGCCCTTGCACTATACCACCATCATGAAGCAGCATGTTTGTAGCCTGCTAGT 

GGGA i^ TCATGGGTAGGAGGC ^ 

TTACCITTCrGTGGTCCTAATGTCATAGATCACrTTATGTGTGATCTCTACACTTTGATCAA 
TCTTGCCTGCACTAATACCCACACTCTAGGACTCTTCATTGCTGCCAACAGTGGGTTCATAT 
GCCTGTTAAACTGTCTCTTGCTCCTGGTCTCCTGCGTGGTCATACTGTACTCCTTAAAGACC 
55 CACAGCTTAGAGGCAAGGCATGAAGCCCTCTCTACCTGTGTCTCCCACATCACAGTTGTCA 
TCrrATCCTTTATACCCrGCATATTTGTGTACATGAGACCTCCAGCTACTTTACCCATTGAT 
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ETLNSDDLDATDKLOTFYRVMTPMMNPLIYSLRhlKDVKEA (SEQ ID NO: 

375) 

ATGGAATGGGAAAACCACACCATTCTGGTGGAATTTTTTCTGAAGGGACTTTCTGGTCACC 
5 CAAGACTTGAGTTACTCTTTTTTGTGCTCATCITCATAATGTATGTGGTCATCCTTCTGGGG 
AATGGTACTCTCATTTTAATCAGCATCTTGGACCCTCACCTTCACACCCCTATGTACTTCTT 
TCTGGGGAACCTCTCCnTCTTGGACATCTGCTACACCACCACCTCTATTCCCTCCACGCTAG 
TGAGCTTCCTTTCAGAAAGAAAGACCATTTCCCTTTCTGGCTGTGCAGTGCAGATGTTCCT 
CAGCTTGGCCATGGGGACAACAGAGTGTGTGCTTCTGGGCGTGATGGCCTTTGACCGCTAT 

10 GTGGCTATCTGCAACCCTCTGAGATATCCCATCATCATGAGTAAGGATGCCTATGTACCCA 
TGGCAGCTGGGTCCTGGATCATAGGAGCTGTCAATTCTGCAGTACAAACAGTGTTTGTGGT 
ACAATTGCCTTTCTGCAGGAATAACATCATCAATCATTTCACCrGTGAAATTCTAGCTGTC 
ATGAAACTGGCCTGTGCTGACATCTCAGGCAATGAGTTCATCCTGCTTGTGACCACAACAT 
TGTTCCTATTGACACCTTTGTTATTAATTATTGTCTCTTACACGTTAATCATTTTGAGCATC 

1 5 TTCAAjVATTAGCTCITCGGAGGGGAGAAGCAAACCTrCCTCTACCTGCTCAGCTCGTCTGA 
CTGTGGTGATAACATTCTGTGGGACCATCTTCCTCATGTACATGAAGCCCAAGTCTCAAGA 
GACACTTAATrCAGATGACTTGGATGCCACTGACAAACTTATATTCATATTCTACAGGGTG 
ATGACTCCCATGATGAATCCTTTAATCTACAGTCTTAGAAACAAGGATGTGAAGGAGGCA 
GTAAAACACCTACTGAGAAGAAAAAATTTTAACAAGTAA (SEQ ID NO: 376) 

20 

AOLFR203 sequences: 

MKRONQSCVVEFILLGFSNFPELQVQLFGWLVIYVVTLMGNAIITVIISLNQSLHWMYLFLLN 
LSVVEVSFSAVITPEMLVVLSTEKTMISFVGCFAQMYFILLFGGTECFLLGAMAYDRFAAICHPL 
NYPVIMNRGVFMK1VIFSWISGIMVATVQTTWVFSFPFCGPNEINHLFCETPPVLELVCADTFLF 
25 EryAFTGTILIVMVPFLLILLSYIRVLFAILKMPSTTGRQKAFSTCASHLTSVTLFYGTANMTYLQ 
" PKSGYSPETKKLISLAYTLLTPLLNPL1YSLRNSEMKRTLIKLWRRKVILHTF (SEQ ID NO: 377) 

ATGAAAAGACAAAATCAAAGCTGTGTGGTTGAATTCATCCTCCTGGGCTTTTCTAACTTTC 
CTGAGCTCCAGGTGCAGCTCTTTGGGGTTTTCCTAGTTATTTATGTGGTGACCCTGATGGG 

30 AAATGCCATCATTACAGTCATCATCTCCTTAAACCAGAGCCTCCACGTTCCCATGTACCTGT 
TCCTCCTGAACCTATCTGTGGTGGAGGTGAGTrTCAGTGCAGTCATTACGCCTGAAATGCT 
GGTGGTGCTCTCTACTGAGAAAACTATGATTTCrrTTTGTGGGCTGTTTTGCACAGATGTAT 
ITCATCCTTCTTITTGGTGGGACTGAATGTTTTCTCCTGGGAGCGATGGCTTATGACCGATr 
TGCTGCAATTTGCCATCCTCTGAACTACCCAGTGATTATGAACAGAGGGGTTTTTATGAAA 

35 TTAGTAATATTCTCATGGATCTCAGGGATCATGGTGGCTACTGTGCAGACCACTTGGGTAT 
TTAGTTTTCCATTTTGTGGCCCCAATGAAATTAATCATCTCTTCTGTGAGACTCCCCCGGTA 
CTAGAGCTTGTGTGTGCAGACACCTTCTTATTTGAAATCTATGCCTTCACAGGCACCATTTT 
GATTGTrATGGTTCCTTTCTTGTrGATCCTCTTGTCTTACATTCGAGTTCTGTTTGCCATCCT 
GAAGATGCCATCAACTACTGGGAGACAAAAGGCCTTTTCCACCTGTGCCTCTCACCTCACA 

40 TCTGTGACCCTGTTCTATGGCACAGCCAATATGACTTATTTACAACCCAAATCTGGCTACTC 
ACCCGAAACCAAGAAACTGATCTCATTGGCTTACACGTTGCTTACCCCTCTGCTCAATCCG 
CTCATCTATAGCTTACGAAACAGTGAGATGAAGAGGACTTTGATAAAACTATGGCGAAGA 
AAAGTGATTTTACACACATTCTGA (SEQ ID NO: 378) 

45 AOLFR204 sequences: _ 
MEKKKNVTEFILIGLTQNPIMEK\TTFVWLVLYM^ 

mTVYSSSSAPKLIXODSFQEKKnSFNGCMAQAYAEHIFGATEnLLTVMACDCYVAICKPLNYTT 
IMSHSLCILLVAVAWVGGFLHATIQILFTVWLPFCGPNVIGHFMCDLYPLIiCLVCIDTHTLGLFV 
AVNSGFICLLNFLILVVSY^ILRSLKNNSLEGRCKALSTCISHIIVVVIJFWCIFVYLRSV 
50 DKAVAWYTNTVVPMLNPVVYTLRNAEVKSAIRKLWRKKVTSDND (SEQ ID NO: 379) 

ATGGAGAAGAAAAAGAATGTGACTGAATTCATITrAATAGGTCTTACACAGAACCCCATA 
ATGGAGAAAGTCACGTTTGTAGTATTTTTGGTTCTTTACATGATAACACITTCAGGCAACC 
TGCTCATTGTGGTTACCATTACCACCAGCCAGGCTCTGAGCTCCCCCATGTACTTCTTCCTG 
55 ACCCACCTTTCTTTGATAGACACAGTTTATTCTTCITCTTCAGCTCCTAAGTTGATTGTGG^ 
TTCCTITCAAGAGAAGAAAATCATCTCCTTTAATGGGTGTATGGCTCAAGCCTATGCAGAA 
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AOLFR200 sequences: 
MTOJChTYTSLTEFVLLGL^JDTLELQIIL^ 

SFVDV(>JSTT1TPKMLADLLSEKXTISFAGCFLQMYFFISLATTC 
YSLIMSRTVYLKMAAGAFAAGLLNFMVNTSHVSSLSFCDSNV^ 
SISSII^GVhnVGTLLVILSSYSYVLFSIFSMHSGEGRHRAFSTCASm^^ 
SYSLNQDKVASWYTWIPMLNPLr/SLRSKJiVKKALANVlSRKRTSSFL (SEQ ID NO: 371) 

ATGACCAGAAAAAATTATACCTCACTGACTGAGTTCGTCCTATTGGGATTAGCAGACACGC 
TGGAGCrACAGATTATCCTCTTTTTGTTTTTTCTTGTGATTTATACAOT 

AATCTCGGGATGATCCTCTTAATCAGGATCGATTCCCAGCTTCACACACCCATGTATTTCTT 

CCTGGCTAACCTGTCCTTTGTGGACGTTTGTAACTCAACTACCATCACCCCAAAGATGCTG 

GCAGATTTATTATCAGAGAAGAAAACCATCTCITTTGCTGGCTGCTTCCTACAGATGTACT 

tctttatctccctggcgacaaccgaatgcatcctctttgggttaatggcctatgacaGgta 

TGCGGCCATATGTCGCCCGCTGCTTTACTCCTTGATCATGTCCAGGACCGTCTACCTAAAA 

atggcagccggggcttttgctgcagggttgctgaacttcatggtcaacacaagccatgtca 

GCAGCTTGTCATTCTGTGACTCCAATGTCATCCATCACITCTrCTGTGACAGTCCCCCACrT 

TTCAAGCTCTCTTGTTCTGACACAATCCTGAAAGAAAGCATAAGTTCTATTTTGGCTGGTG 

TGAATATTGTGGGGACTCTGCTTGTCATCCTCTCCTCCTACTCCTACGTTCTCTTCTCCATT 

TTTTCTATGCATTCGGGGGAGGGGAGGCACAGAGCTTTCTCCACGTGTGCCTCTCACCTGA 

CAGCCATAATTCTGTTCTATGCCACCTGCATCTATACTTACCTGAGACCTAGTTCCAGCTAC 

TCCCTGAATCAGGACAAAGTGGCTTCTGTGTTCTACACAGTGGTGATTCCCATGTTGAATC 

CTCTGATCTACAGCCTCAGGAGTAAGGAAGTAAAGAAGGCTTTAGCGAATGTAATTAGCA 

GGAAAAGGACCTCTTCCTTTCTGTGA (SEQ ID NO: 372) 

AOLFR201 sequences: 

NffiWENHTILVEFFLKGLSGHPPXELLFFVLIFIMYVVILLGNGTLILISILDPHLHTPMYFFLGNL 
SFLDICYTTTSIPSTLVSFI^ERKTISI^GCAVQMFLGLAMGTTECVLLGMMAFDRWAICNPLR 
YPIIMSKDAYWMAAGSWIIGAVNSAVQSVFWQLPFCRNNIINHFTCEILAVMnKJ^ 
EFIl^VATTLFILTPLLLIIVSYTLUVSira^ 

KETLNSDDLDATDKIISMFYGVMTPMMNPLIYSLRNKDVKEAVKHLLNRRFFSK (SEQ ID NO: 

ATGGAATGGGAAAACCACACCATTCTGGTGGAATTTTTTCTGAAGGGACTTTCTGGTCACC 
CAAGACTTGAGTTACTeTTTTTTC^ 

AATGGTACTCTCATTTTAATCAGCATCTTGGACCCTCACCTTCACACCCCTATGTACTTCTT 

TCTGGGGAACCTCTCCTTCTTGGACATCTGCTACACCACCACCTCTATTCCCTCCACGCTAG 

TGAGCITCCTTTCAGAAAGAAAGACCATTTCCCTTTCTGGCTGTGCAGTGCAGATGTTCCT 

CGGCTTGGCCATGGGGACAACAGAGTGTGTGCTTCTGGGCATGATGGCCTTTGACCGCTAT 

GTGGCTATCTGCAACCCTCTGAGATATCCCATCATCATGAGTAAGGATGCCTATGTACCCA 

TGGCAGCTGGGTCCTGGATCATAGGAGCTGTCAATTCTGCAGTACAATCAGTGTTTGTGGT 

ACAATTGCCTTTCTGCAGGAATAACATCATCAATCATTTCACCTGTGAAATTCTGGCTGTC 

ATGAAACTGGCCTGTGCTGACATCTCAGACAATGAGTTCATCATGCTTGTGGCCACAACAT 

TGTTCATATTGACACCTTTGTTATTAATCATTGTCTCTTACACGTTAATCATTGTGAGCATC 

TTCAAAATTAGCTCTTCCGAGGGGAGAAGCAAAGCTTCCTCTACCTGTTCAGCCCATCTGA 

CTGTGGTCATAATATTCTATGGGACCATCCTCTTCATGTACATGAAGCCCAAGTCTAAAGA 

GACACTTAATTCGGATGACTTGGATGCTACCGACAAAATTATATCCATGTTCTATGGGGTG 

ATGACTCCCATGATGAATCCnTAATCTACAGTCTTAGAAACAAGGATGTGAAAGAGGCA 

GTAAAACACCTACTGAACAGAAGGTTCTTTAGCAAGTGA (SEQ ID NO: 374) 

AOLFR202 sequences: 

MEWF^TILVEFFLKGLSGHPPJ.ELLFFVIJFIMYVVILLGNGTLILISnj^ 
SFLDICYTTTSIPSTLVSFLSERKTISLSGCAVQMFI^LAMGTTECV^ 

raM s*^ YVI, MAAGswn^ 

EFIIXVTTTLFLLTPll.LirVSYTLIII^IFKISSSEGRSKPSSTCSARLTVVIT^ 
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GCTGCCCTTCTGTGGGCATCGCCAAGTGGATCATTTCATCTGCGAGGTCCCTGTGCTCATC 
AAGCTGGCTTGTGTGGGCACCACGTTTAACGAGGCTGAGCTTT^ 

TCCTTMAGTGCCTGTCTCATTCATCCTGGTCTCCTCTGGCTACATTGCCCACGCAGTGTTG 
AGGATTAAGTCAGCTACCAGGAGACAGAAAGCATTCGGGACCTGCTTCTCCCACCTGACA 
S nTGGTCACCATCTTTTATGGAACCATCATCTTCATGTATCTGCAGCCAGCCAAGAGTAGAT 
CCAGGGACCAGGGCA^GTTTGTTTCTCT 

TCTTATTTATACCTTGAGGATCAAGGAGGTGAAAGGGGCATTAAAGAAAGTTCTAGCAAA 
GGCTCTGGGAGTAAATATTTTATGA (SEQ ID NO: 366) 

10 AOLFR198 sequences: 

MENCTEVTKFILLGLTSVPELQIPLFILFTFIYLLTLCGNLGMMLLILMDSCLHTPMYFFLSNLSL 

^FGYSSAVTPKVMAGFLRGDKVISYNACAVQMFFFVALATVF^LLASMAYDRYAAVC^ 

LHY^MTASVGACLALGSYVCGFLNASFHIGGIFSl^FCKSNLVHHFFCDWAVMALSCSDKH 

TSEVILWMSSFNIFFVLLV1FISYLFIFITILKN4HSAKGHQKALSTCASHFTAVSW 

1 5 PSSSHSMDTDKMASVFYAMIIPMLNPWYSLRNREVQNAFKKVLRRQKFL (SEQ ID NO: 367) 

ATGGAGAATTGTACGGAAGTGACAAAGTTCATTCTTCTAGGACTAACCAGTGTCCCAGAAC 
TACAGATCCCOCn'CTTTATCTrGTTCACCTTCATCT 
GGGATGATOTGCTGATCCTGATGGACT 
20 TAACCTGTCTCTGGTGGACTTTGGATACTCCT 
TTOCTTAGAGGAGACAAGGTCATCTC 
AGCOTGGCCACGGTGGAAAATTACTO^ 

GTGTCCAAACCCCTACACTACACCACCACCATGACGGCCAGTGTAGGTGCCTGTCTGGCCC 

?IS G CTCATATGTCTGTGCKnTC 
25 ^CTT^CTA^TCCAATCTGGTACATCACTTm 
GTOCTGCTCTGATAAACACACTAGTGAGGTC 

TFrWGTTCTTCTAGTTATCTTTATCTCCTACTTGTTCATATTCATC^ 
ATTCA^CT^GGACACCAAAAAGCATTGTCCACCTGTGCCrCTCACITC^ 

CGTCTCCTATGGGACAGTAATCTTCATCT 
30 ACAGACAAAA^GGCATCTGTGTTCTATGCTATGATCATCCCCATGCTGAACCCTGTGGTCT 

1c1gcc^ga\cagagaagtccagaatgcattcaagaaagtgttgagaaggcaaa^ 
ttctataa (seq id no: 368) 
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AOLFR199 sequences: 

MDTGNKTLPQDFLLLGFPGSQTLQLSLFMLFLVMYILTVSGNVAILMLVSTSHQLHTO^LS 
nIsfLeWiTAAWKAI^ 

SygSssllsaqla^ 

TSIKDALDL1KAVHVLNTVVTPVLNPFIYTLRNKEVRETLLKJCWKGK (SEQ ID NO: 369) 



ATGGACACAGGCAACAAAACTCTGCCCCAGGACTTTCTCTTACTGGGCTTTCCTGGTTCTC 
AATCTGGCTAT^ 

TCTGAGCAACCTCrCCTTCCTGGAGATTTGGTATACCACAGCAGCAGTGCCC 

fire ATCCTACTGGGGAGAAGTCAGACCATATCATTTACAAGCTGTCTTTTGCAGATGTACT 

tot^^c^Xggct^ 

CTTGCCATCTGCTATCCTTTACACTACGGAGCCATCATGAGTAGCCTGCTCTCAGCGCAGC 
TGGCCCTGGGCTCCTGGGTGTGTGGTTTCGTGGCCATTGCAGTGCCCACAGCCCTCATCAG 

TGGCCTGTCCTTCTGTGGCCCCCGTGCCATCAACCACTT 

50 Sgccctggcctgcaccaacacacaggcagtagagcttgtggcctttgtgattgct^ 

GOTATCCTGAGTTCATGCCTCATCACCTTTGTCTCCTATGTGTACATCATCAGCA 
TCAGGATCCCCTCTGCCAGTGGCCGGAGCAAAGCCTTCTCCACGTGCTCCTCGCATCTC 
CGTGGTGCTCA'TTTGGTATGGGTCCACAGTTTTCCTrCACGTCCGC 
GCCTTGGATCTGATCAAAGCTGTCCACGTCCTGAACACTGTGGTGACTrc 
55 CC^^GATCTATACGCrrTCGTAATAAGGAAGTAAGAGAGACTCTGCTGAAGAAATGGAAGG 

GAAAATAA (SEQ ID NO: 370) 
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TCTCTGGATGTGCAGTGCAGATGTTCITTGGGTTrGCAATGGGGTCAACAGAATGTCTGCT 

TCITGGCATGATGGCATTTGATCGTTATGTGGCCATCTGCAACCCACTGAGATACCCCATC 

ATCCTGAGCAAGGTGGCGTATGTATTGATGGCTTCTGTGTCCTGGCTGTCCGGTGGAATAA 

ATTCAGCTGTGCAAACATTACTTGCCATGAGACTGCCTTTCTGTGGGAATAATATTATCiCC 

5 TCATTTCGCATGTGAAATATTAGCTGTCCTCAAGCTGGCCTGTGCTGATATATCCCTCAATA 

TrATCACCATGGTGATATCAAATATGGCCTTCCTGGTTCrrCCACTGATGGTCATITmC 

^™ ATG " CATCCTCTACACCAT <^GCAAATGAATTCAGCCACAGGAAGACGCAAGG 

CATTTTCCACGTGCTCAGCTCACCTGACTGTGGTGATCATATTTTACGGTACCATCTrcm 

1 n AT G^ A TGCGAAACCGAAGTCTCAAGACCTGATTGGGGAAGAAAAATTGCAAGCATTAGAC 

1 0 AAGCTCATTTCTCTGTTTTATGGGGTAGTGACACCCATGCTGAATCCTATACTCTATAGCTT 

GAGAAATAAGGATGTAAAAGCTGCTGTAAAATATTTGCTGAACAAAAAACCAATTCACTA 
A (SEQ ID NO: 362) 1 



15 



AOLFR196 sequences: 

MLESNYTMPTEFLFVGFTDYLPLRVTLFLWLLVTTLTMVGNILLIILVNINSSLO 
SFLDISCSTAITPKMLANFLASRKSrSPYGCALQMFFFASFADAECLILAAMAYDRYAAJCNPLL 
YTTLMSRRVCVCFIVLAWSGSTTSLVHVCLTFRLSFCGSNIVNHFFCDIPPLLALSCTDTOINOL 
LLFALCSFIQTSTFVVIFISYFCILITVI^IKSSGGRSKTFSTCASHL^ 

' on ^ S ^ T DKVVAWYTVWPMFNPIIYSFRNKDVKNALKja.LERIGYSNEV^hn^ 
.<£U ID NO: 363) 

ATGTTGGAGAGTAATTACACCATGCCAACTGAGTTCCTATTTGTTGGATTCACAGATTATC 

TACCTCTCAGAGTCACACTGTTCTTGGTATTCCTTCTGGTATATACATTAACTATGGTCGGA 

AATATACTCTTAATAATTCTAGTTAATATTAATTCAAGCCTTCAAATTCCCATGTATTATTT 

25 TCTT AGCAACTTATCTTTCTTAGACATCAGCTGTTCTACAGCAATCACTCCTAAAATGCTGG 
CAAACTTCTTGGCATCCAGGAAAAGCATCTCTCCTTATGGGTGTGCACTACAAATGTTTTT 
CTTCGCTTCTTTTGCTGATGCTGAGTGCCTTATCCTGGCAGCAATGGCTTATGACCGCTATG 
CAGCCATCTGCAACCCACTGCTCTATACTACACTGATGTCTAGGAGAGTCTGTGTCTGCTT 
CATTGTGTTGGCATATTTCAGTGGAAGTACAACATCACTGGTCCATGTGTGCCTCACATTC 

30 AGGCTGTCATTTTGTGGCTCCAATATCGTCAATCATTTTTTCTGTGATATCCCACCTCTTCT 
GGCTITATCATG'TACAGACACTCAGATCAACCAGCITCTGCTCTTTGCTTTGTGCAGCTTCA 
TCCAGACCAGCACTTTTGTGGTAATATTTATTTCTTACTTCTGCATCCTCATCACTGTGTTG 
AGCATCAAGTCCTCAGGTGGCAGAAGCAAAACATTCTCCACTTGTGCTTCCCACCTCATAG 
CAGTCACCTTATTCTATGGAGCGCTCCrGTTTATGTACTTACAGCCCACCACTAGCTATTCC 

35 CTAGACACTGATAAGGTGGTGGCAGTGTTTTATACTGTTGTATTTCCCATGTTTAATCCAA 
TAATTTATAGTTTCAGAAACAAGGATGTGAAAAATGCTCTCAAAAAGCTATTAGAAAGAA 
TTGGATATTCAAATGAATGGTATTTAAATCGTTTAAGAATAGTCAATATCTAA (SEQ ID NO: 

40 AOLFR197 sequences: 

MCYLSQLCLSLGEHTLHMGMVRHTNEShlLAGFILLGFSDYPQLQKVLFVXILILYLLmGNTn 
ILVSRLEPKLHMPMYFFLSHI^FLYRCFTSSVIPQLLVNLWEPMKTIAYGGCLVHLYNSHALGS 
TECVLLALMSOTRWAVCRPLHYTVLMHIHLCMALASMAWLSGIATTLVQSTLTLOLPFCGH 
RQVDHHCEWVLIKiACVGTTFNEAELFVASILFLIVPVSFILVSSGYIAJMVLRK 
45 ^TCFSHLTVVTIFYGTIIFMYLQPAKSRSRDQGKFVSLFYTVV^ 
VLAKALGVNIL (SEQ ID NO: 365) 

ATGTGTTATCTTTCTCAGCTATGCCTCAGCCTTGGGGAACACACTTTACATATGGGGATGG 
TGAGACATACCAATGAGAGCAACCTAGCAGGTTTCATCCTrTTAGGGTTTTCTGATTATCC 

50 TCAGTTACAGAAGGTTCTATTTGTGCTCATATTGATTCTGTATTTACTAACTATTTTGGGGA 
ATACCACCATCATTCTGGTTTCTCGTCTGGAACCCAAGCTTCATATGCCGATGTATTTCTTC 
CTTTCTCATCTCTCCTTCCTGTACCGCTGCTTCACCAGCAGTGTTATTCCCCAGCTCCTGGT 
AAACCTGTGGGAACCCATGAAAACTATCGCCTATGGTGGCTGTTTGGTTCACCTTTACAAC 
TCCCATGCCCTGGGATCCACTGAGTGCGTCCTCTTGGCTCTGATGTCCTGTGACCGCTATGT 

55 GGCTGTCTGCCGTCCTCTCCATTACACTGTCTTAATGCATATCCATCTCTGCATGGCCTTGG 
CATCTATGGCATGGCTCAGTGGAATAGCCACCACCCTGGTACAGTCCACCCTCACCCTGCA 
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ATGGAAAATAAGACAGAAGTAACACAATTCATTCTTCTAGGACTAACCAATGACTCAGAA 
CTGCAGGTTCCCCTCITTATAACGTTCCCCTTCATCTATATTATCACTCTGGTTGGAAACCT 
GGGAATTATTGTATTGATATTCTGGGATTCCTGTCTCCACAATCCCATGTACTTTTTTCTCA 
GTAACTTGTCTCTAGTGGACTTTTGCTACTCTTCAGCTGTCACTCCCATCGTCATGGCTGGA 
5 TTCCTTATAGAAGACAAGGTCATCTCTTACAATGCATGTGCTGCTCAAATGTATATCTTTGT 
AGCITTTGCCACTGTGGAAAATTACCTCTTGGCCTCAATGGCCTATGACCGCTATGCAGCA 
GTGTGCAAACCCCTACATrACACCACAACCATGACAACAACTGTGTGTGCTCGTCTGGCCA 
TAGGCTCCTACCTCTGTGGTTTCCTGAATGCCTCCATCCACACTGGGGACACATTTAGTCTC 
TCTTTCTGTAAGTCCAATGAAGTCCATCACTTTTTCTGTGATATTCCAGCAGTCATGGTTCT 

1 0 CTCTTGCTCrGATAGACATATTAGCGAGCTTGTTCTTATTTATGTTGTGAGCTTCAATATCT 
TrATAGCTCTCCTGGrrATCTTGATATCCrACACATTCATTTTTATCACCATCCTAAAGATG 
CACTCAGCTTCAGTATACCAGAAGCCTTTGTCCACCTGTGCCTCTCATTTCATTGCAGTCGG 
CATCTTCTATGGGACTATTATCTTCATGTACTTACAACCCAGCTCCAGTCACTCCATGGACA 
CAGACAAAATGGCACCTGTGTTCTATACAATGGTCATCCCCATGCTGAACCCTCTGGTCTA 

1 5 TAGTCTGAGGAACAAGGAAGTGAAGAGTGCATTCAAGAAAGTTGTTGAGAAGGCAAAATT 
GTCTGTAGGATGGTCAGTTTAA (SEQ ID NO: 358) 

AOLFR194 sequences: 

MERONOSCVVEFILLGFSWPELQGQLFVAFLVIYLVTLIGNAIIIVIVSLDQSLHVPMYLFLLNL 
20 SVVDLSFSAVIMPEMLVVLSTEKTTISFGGCFAQMYFILLFGGAECFLLGAMAYDRFAAICITOL 
" " NYOMIMNK.GWMKLIIFSWALGFMLGTVQTSWVSSFPFCGLNEINHISCETPAVLELACADTFL 
FErYAFTGTFLIILVPFLLILLSYIRVLFAILKMPSTTGRQKAFSTCAAHLTSVTLFYGTASMTYLQ 
PKSGYSPETKKVMSLSYSLLTPLLNLLIYSLRNSEMKRALMKLWRRRVVLHTI (SEQ ID NO: 

359) 

25 

ATGGAAAGACAAAATCAAAGCTGTGTGGTTGAATTCATCCTCTTGGGCTTTTCTAACTATC 
CTGAGCTCCAGGGGCAGCTCTTTGTGGCTrTCCTGGTTATTTATCTGGTGACCCTG^ATAGG 
AAATGCCATTATTATAGTCATCGTCTCCCTAGACCAGAGCCTCCACGTTCCCATGTACCTGT 
TTCrCCTGAACTTATCTGTGGTGGACCTGAGTTTCAGTGCAGTTATTATGCCTGAAATGCT 

30 GGTGGTCCTCTCTACTGAAAAAACTACAATTTCTTTTGGGGGCTGTTTTGCACAGATGTAT 
TTCATCCTTCTTTITGGTGGGGCTGAATGTTTTCTTCTGGGAGCAATGGCTTATGACCGATT 
TGCTGCAATTTGCCATCCTCTCAACTACCAAATGATTATGAATAAAGGAGTTTTTATGAAA 
TTAATTATATTTTCATGGGCCnTAGGTTTTATGTTAGGTACTGTTCAAACATCATGGGTATC 
TAGTTTTCCCITrTGTGGCCTTAATGAAATTAACCATATATCTTGTGAAACCCCAGCAGTGT 

35 TAGAACrrGCATGTGCAGACACGTTTTTGTTrGAAATCTATGCATTCACAGGCACCTTmG 
ATTATTTTGGTTCCTTTCTTGTTGATACTCnrTGTCnTACATTCGAGTTCT 
AAGATGCCATCAACCACTGGGAGACAAAAGGCCTTTTCCACCTGTGCCGCTCACCTCACAT 
CTGTGACCCTATTCTATGGCACAGCCAGTATGACTTATTTACAACCCAAATCrcG^^ACl^ 

ACCGGAAACCT^GAAAGTGATGTCATTGTCT™ 
40 CTTATCTACAGTTTGCGAAATAGTGAGATGAAGAGGGCTTTGATGAAATTATGGCGAAGG 

CGAGTGGTTTTACACACAATCTGA (SEQ ID NO: 360) 
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AOLFR195 sequences: 

MIVOLICTVCFLAVNTFHWSSFDFLKADDMGEINQTLVSEFLLLGLSGYPKIErSATALILVMY 

LVIOGNGVLIIASIFDSHFHTPMWFLGNLSFLDICYTSSSWSTLVSLISKKRNISFSGCAVQMFF 

GFAMGSTECLLLGMMAFDRYVAICKPLRYPIILSKVAYVLMASVSWLSGGINSAVQTLLAMRL 

PFCGNNIINHFACEILA^KLACADISLNIITMVISNMAFLVLPLMVIFFSYMFILYTILQMNSATG 

RRKAFSTCSAHLTWIIFYGTIFmYAKPKSQDLIGEEKLQALDKLISLFYGWTPMLNPILYSLR 

NKDVKAAVKYLLNKKPIH (SEQ ID NO: 361) 



ATGATTGTTCAGTTAATITGTACTGTTTGTTTCTTGGCAGTAAATACATTTCATGTTAGATC 
TTCTTTTGATTTCCTGAAAGCAGATGACATGGGTGAGATTAACCAGACACTTGTGTCAGAA. 

TTTCTTCTTCnX3GGTCTTTCTGGATACCCAA 

AGTTATGTACCTAGTGATTCTAATTGGCAATGGTGTTCTAATCATAGCCAGCATCTTOA'TT 

55 CTCATTTTCACACACCAATGTACTC 

ACATCCTCCTCTGTTCCCTCAACATTGGTGAGCrTAATCTCAAAGAAAAGAAACATTTCCI 
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MTGGGNnTEITYFILLGFSDFPRJIKVLFTIFLVIYITSL^ 

FmVCYlSSTVPKMLS^LQEQQTITFVGCIIQYFIFSTMGI^ESCLMTAMAYDRYAAICNPL^ 
S1MSPTLCVWMVLGAYMTGLTASIJQIGALLQLHFCGSNVIRHFF<^MPQ 

MTAILTMFFGMSALVIMISYGYIGISDVIXITSAKGSPKAFOTCASHLTAVSI^^ 
GGSSSFDPJASVFYTWIPMLNPLITSLRNKEIKDALKRLQKRKCC (SEQ ID NO: 353) 

ATGACTGGGGGAGGAAATATTACAGAAATCACCTATTTCATCCTGCTGGGATTCTCAGATT 

TTCCCAGGATCATAAAAGTGCTCTTCACTATATTCCTGGTGATCTACATTACATCTCTGGCC 

TGGAACCTCTCCCTCATTGTTTTAATAAGGATGGATTCCCACCTCCATACACCCATGTATTT 

CTTCCTCAGTAACCTGTCCTTCATAGATGTCTGCTATATCAGCTCCACAGTCCCCAAGATGC 

TCTCCAACCTCTTACAGGAACAGCAAACTATCACTTTTGTTGGTTGTATTATTCAGTACTTT 

ATCTTTTCAACGATGGGACTGAGTGAGTCTTGTCTCATGACAGCCATGGCTTATGATCG-TT 

ATGCTGCCATITGTAACCCCCTGCTCrATTCATCCATCATGTCACCCACCCrCTGTGriTGG 

ATGGTACTGGGAGCCTACATGACTGGCCTCACrGCTTCTTTATTCCAAATTGGTGCTTrGCT 

TCAACTCCACTTCTGTGGGTCTAATGTCATCAGACATTTC1TCTGTGACATGCCCCAACTGT 

TAAT<m-GTCCTGTACTGACACTTTCTTTGTACAGGTCATGACTGCTATATTAACCATGTTC 

TTTGGGATAGCAAGTGCCCTAGTTATCATGATATCCTATGGCTATATTGGCATCTCCATCA 

TGAAGATCACTTCAGCTAAAGGCAGTCCAAAGGCATTCAACACCTGTGCrTCTCATCTAAC 

AGCTGTTTCCCTCTTCTATACATCAGGAATCrTTGTCTATTTGAGGTCCAGCT 

CTTCAAGCITTGACAGATTTGCATCTGTTTT^ 

TTGATTTACAGTTTGAGGAACAAAGAAATTAAAGATGCCTTAAAGAGGTTGCAAAAGAGA 
AAGTGCTGCTGA (SEQ ED NO: 354) 



AOLFR192 sequences: 



MENNTEVTEFILVGLTDDPELQIPLFIVFLFI^ 

DFGYSSAVTPKVMVGFLTGDKF1LYNACATQFFFFVAFITAESFLLASMAYDRYAALCXPLHY 
111 MTTNVCACLAIGSY1CGFLNASIHTGNTFRLSFCRSNVVEHFFCDAPPLLTLSCSDNYISEM 
VIEFVVGFNDLFSILYILISYLFinTlMKMRSPEGRQKAFSTCASHLTAVSIFYGTGIF 
HFMGTDKMASWYAIVIPMLNPLVYSLRNKEVKSAFKXTVGKAKASIGFIF (SEQ ID NO- 355) 



ATGGAGAACAACACAGAGGTGACTGAATTCATCCTTGTGGGGTTAACTGATGACCCAGAA 

CTGCAGATCCCACTCTTCATAGTCTTCCTTTTCATCTACCTCATCACTCTGGTTGGGAAC 

GGGGATGATTGAATrGATTCTACTGGACTCCrGTCTCCACACCCCCATGTACTTCTTCCTCA 

GTAACCTCTCCCTGGTGGACTTTGGTTATTCCTCAGCTGTCACTCCCAAGGTGATGGTGGG 

GTTTCTCACAGGAGACAAATTCATATTATATAATGCTTGTGCCACACAATTCTTCTTCTTTG 

TAGCCrTTATCACTGCAGAAAGTTTCCTCCTGGCATCAATGGCCTATGACCGCTATGCAGC 

ATTGTGTAAACCCCTGCATTACACCACCACCATGACAACAAATGTATGTGCTTGCCTGGCC 

ATAGGCTCCTACATCTGTGGTTTCCTGAATGCATCCATTCATACTGGGAACACTTTCAGGC 

TCTCCnTCTGTAGATCCAATGTAGTTGAACACTTTTTCTGTGATGCTCCTCCTCT 

CTCT Si!?^!ir rcAGAC ^ CTA ^^ 

ACCTCTTTrCTATCCTGGTAATCTTGATCTCCTACTTATTTATATTTATCACCA 

ATGCGCTC\CCTGAAGGACGCCAGAAGGCCTrTTCTACTTGTGCTTCCCACCTTACT 

TTTCCATCTT1TATGGGACAGGAATCTTTATGTACTTACGACCT 

GGCACAGACAAj^TGGCATCTGTGTTCTATGCCATAGTCATTCCCATGTTGAATCCACTGG 

T ^™ GCCTGAGGAACAAAGAGG ™^^ 
AGGCCTCTATAGGATTCATATTTTAA (SEQ ID NO: 356) 

AOLFR193 sequences: 

MENKTEVTQFIl.LGLTNDSELQWLnTFTFrynTLVGNLGirV^IFWDSCLHN^ 

DFCTSSAVTPr^AGFLIEDK\aSYNACAAQMYIF^AFATVENYLLASMAYDRYAAVCKPLHY 
TTTMTTTVCARLAIGSYTXXjFLNASMTGDTFSI^FCK^ 

VIJYWSFNIFIALLVIEISYTFIHTILKMHSASWQKP^ 

SMDTDKMAP Vl^TMVIPMLNPLVYSLRNKii VKS AFT^ (SEQ ID NO- 
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AOLFR190 sequences: 

====== 

YSLKRDKMVSTFYTMLFPMLNPMIYSLRSKDMKDALKKFFKSA (SEQ ID NO: 351) 

ATGCAGAGGAGCAATCACACAGTGACTGAGTTCATCCTGCT 
GGATGCAACTG^GGCCTCTTTGTGGTGTTCCTGGGTGTGTACTGTCTGACTGTC 

i o ?agScc^atcgtc™^ 

^ggaaatctgtcai^ctggatctctggtattcttct 

a^ct^Sctgaagacaaaagca 

cSgctggcct^^ 
a^ctccaagc^ 

15 ATA^CC^ATACTG^GGGGTTTTGTCAATGCAATAATATTAA 
GATTTTTGTGGTGACAATGTCATTGATGACTTTT^ 

G^CA^^TG^ 
ATCTCCGCTACTGTC 

20 cC^ScTATGGCTC^ 

caSctg^agtaaagacatgaaagacgctct^ 

(SEQ ID NO: 352) 
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AOLFR188 sequences: 

MFPSLCPCVLLVQLPL^T^QCFWCSCDSLLR^ 

ELLCVRPWSGKTWSIPJIHIFDMELLTNNLK^ 

GLTQWEGQKXOJVTFlJJYMVTIMGhaXIIVTIMASQSL^ 

MI\nDLI^EKXTISFQGCMAQLFMDHLFAGAEVn,LVVMAYDRYTyLAJCKPLHELIT^ 

MLLAAWlGGFLHSLVQFIJIYQLPFCGPNVIDNFLCDLYPLLKIACrTOTY^ 

AVTFFTILLSYGVILHSLKTQSLEGKPJ<AFYTCASHVT\^L^ 

VLTFITPMLNPLIYTLKNAEMKSAMRKLWSKKVSLAGKWLYHS (SEQ ID NO: 347) 

ATGTTCCCCTCCCTGTGTCCATGTGTTCTCCTTGTTCAACTCCCACTTATGAATGAGAACAT 

GCAGTGTTTTGTTTTCTGTTCITGTGATAGTTTGCTGAGAATGATGGTITCCCGCITC^ 

ATGTCCCATTTGTAAAAATGAAAAGGATAATTGTGGGAGGATATTCTAAACACTTCTrrrc 

TAATGAGCTGCTCTGTGTGAGGCCCTGGTCAGGGAAAACGTGGTCGATAAGGCATCACAT 

TTTTGACATGGAGCTTCTGACAAATAATCTCAAATTT^ 

TCCGACACCTGAGTCCAACACCTTCAGAAGAACACATGAAAAATAAGAACAATGTGACTG 
AATTTATCCTCTTAGGGCTCACACAGAACCCTGAGGGGCAAAAGGTTTTATTTGTCACATT 
OTA^CTAATCTACATGGTGACGATAATGGGCAACCTGCTTATCATAGTGACCATCATGGCC 

CTATTCTACTGCATTTGCTCCCAAAATGATTGTTGAfm'GCTCTCTGAGAAAAAGACCATTT 

CCTTTCAGGGTTGTATGGCTCAACITTTTATGGATCATTTATTTGCTGGTGCTGAAG^^ 

CirCTGGTGGTAATGGCCTATGATCGATACATGGCCATCTGTAAGCCTCTTCATGAATTGA 

TCACCATGAATCGTCGAGTCTGTGTTCTTATGCTGTTGGCGGCCTGGATTGGAGGCTrTCT 

TCACTCATTGGTTCAATTTCTCTTTATTTATCAGCTCCCTTTCTGTGGACCCAATGTCA'^G 

ACAACTTCCTGTGTGATTTGTATCCCTTATTGAAACTTGCrTGCACCAATACCTATGTCACT 

GGGCTTTCTATGATAGCTAATGGAGGAGCGATTTGTGCTGTCACCTTCTTCACTATCCTGC 

TTTCTACACCTGTGCATCCCACGTCACTGTGGTCATTTTATTCTITGTCCCCrGTATCnTCTr 
GTATGCAAGGCCCAATTCTACTTTTCCCATTGATAAATCCATGACTGTAGTTCTAACTTTTA 
TAACTCCCATGCTGAACCCACTAATCTATACCCTGAAGAATGCAGAAATGAAAAGTGCCAT 
GAGGAAACTTTGGAGTAAAAAAGTAAGCTTAGCTGGGAAATGGCTGTATCACTCATGA 



(SEQ ID NO: 348) 
AOLFR189 sequences: 



MQQNNSWEFILLGLTQDPLRQKIVFVIFLIFYMGTWGNMLirVTIKSSRTLGSPM^ 

ADSCFSTSTAPRLIVDAl^EKKnTYNECMTQWALHLFGCMEi^ 

T^SQQVCHLIVLAWIGSLmSTAQIILAL^^ 

VSNSGAICSSSFMILUSYIV]l.HSLRNHSAXGKKXAl^ACTSHIIWILFFGPCIFIYTRPPTrFPNm 
KMVAVFYT1GTPFLNPLIYTSEECRSEKCHEK (SEQ ID NO- 349) 



a I gc 1 gcaa ^aac^^ 

ggcagaaaatagtgtttgtaatcttcntaattttctatatgggaactgtggtggggaatat 
gctcattattgtgaccatcaagtccagccggacactaggaagccccatgtacttctttcta 

TjrTATTTGTCCTTTGCAGATTCTTGCTTTTCAACTTCCACAGCCCCTAGATTAATTGTGGA 

TGCTCTCTCTGAAAAGAAAATTATAACCTACAATGAGTGCATGACACAAGTCTTTGCACTA 

CATTTATTTGGCTGCATGGAGATCTTTGTCCTCATTCTCATGGCTGTTGATCGCTATGTGGC 

^^ T ^ GCC ^ GCG ^CCCAACCATCATGAGCCAGCAGGT(^GCATCATCarGATT 
GTTCTTGCCTGXjATAGGGTCTTTAATACACTCTACAGCTCAOATTA TfTTRWTTTA A/5ATT 



w ill UUAUA1 IcjTTGCTTTTCAACTTCCACAGCCCCTAGATTAj 
TGCTCTCTCTGAAAAGAAAATTATAACCTACAATGAGTGCATGACACAAGTCT 
CATTTATTTGGCTGCATGGAGATCTTTGTCCTCATTCTCATGGCTGTTGATCGC 
GA Z C I GTAAGCCOTGCG ^ ACCC AACCATCATGAGCCAGCAGGTCrGCATCATCCT^ 
G I^ GGC I GGATAG ^^ 
GCCTTTCTGTGGACCCTATTTGATTGATCATTATTGCTGTGATTTGCAGCCCTTGTTGAAAC 
TTGCCTGCATGGACACTTACATGATCAACCTGCTGTTGGTGTCTAACAGTGGGG( 
CTCAAGTAGTTTCATGATTTTGATAATTTCATATATTGTCATCTTGCATTCACTGy 
ACAGTGCCAAAGGGAAGAAAAAGGCTCTCTCCGCTTGCACGTCTCACATAATTG 
CTTATTCTTTGGCCCATGTATATTCATATATACACGCCCCCCGACCACTTTCCCC^ 
AGATGGTGGCAGTATTTTATACTATTGGAACACCCrTTCTCAATCCACTCATCTA 1 
GAGGAATGCAGAAGTGAAAAATGCCATGAGAAAG (SEQ ID NO- 350) 
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GATGTGCCCCGAGTTGTACACATCCTGTTTGCTAATCTCTATCTACTGATACCTCCCATGCT 
CAACCCCATCATTTATGGAGTTAGAACCAAACAGATCGGGGACAGGGTTATCCAAGGATG 

TTGTGGAAACATCCCCTGA (SEQ ID NO: 342) 

5 AOLFR186 sequences: 

MSNASLVTAPILTGLPHAPGLDALLFGIFLVVYVLTVLGNLLILLVmVDSHLHTPMYYFLTOLS 

FIDMWFSTVTWKMLMTLVSPSGRAISFHSCVAQLYFFHFLGSTECFLYTVMSYDRYLAIS^ 

RYTSMMSGSRCALLATGTWLSGSLHSAVQTILTFHLPYCGPNQIQHYFCDAPP1LKLACADTSA 

N^F^IG^^^ 

10 PGSMDAMDGWAII^TVLTPLLNPVWTLRNKEVKKAVLKLRDKVAHPQRK (SEQ ID NO: 
343) 

ATGTCCAACGCCAGCCTCGTGACAGCATTCATCCTCACAGGCCTTCCCCATGCCCCAGGGC 

togIcg^ctcctcVttggaatcttcctggtgg™^ 
i 5 cctcatcctgctggtgatcagggtggattctcacctccacacccccatgtactacttcctca 
ccaacctgtccttcattgacatgtggttctccactgtcacggtgccgaaaatgctgatgac 
c^gtctccccaagcggcagggctatctccttccacagctgcgtggctcag 
tccacttcctggggagcaccgagtgtttcctctacacagtcatgtcctatgatcgctactt^ 

gccatcagWccgctcaggtacaccagc^^^ 
20 ccaccggcacttggctcagtggctctctgcactctgctgtccagaccata™^ 

Sgccctactgtggacccaaccagatccagcact^ 

Iactggcctgtgcagacacctcagccaacgtc^^ 
ggcctcaggctgctttgtcctgatagtgctgtcctatgtgtccatcgtct^ 

ggatccgcacctcagatgggaggcgcagagcctttcagacct^^ 
ggtcc™cttctttgttccctgtgttgtcatttatctgaggccaggctccatggatgcca 

tggatggag^gtggccattttctacact 
accctgagaaacaaggaggtc^ 
cctcagaggaaataa (seq id no: 344) 
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30 AOLFR187 sequences: 

35 

GSK (SEQ ID NO: 345) 
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ATGGCACAGGTGAGGGCGCTGCATAAAATCATGGCCCrTTTTTTCTGCTAACAGCATAGGTG 
CTATGAACAACTCTGACACTCGCATAGCAGGCTGCTTCCTCACTGGC 

GCAACTACATATGTGGCTGTGCATCCCCTTCTGCATCATGTACATCGCTGCCCTGGAAGGC 
AATGGX^TGCTAATTTCT 

GCCAATrrTTOTGGCTAGGTTATAGCCACATTTCCTTTGATGGCT 

SaTctgScSc^ 

CACTGCCACCCTGAGCCGCAGCrTCATCATTATGTTTCCATCCATCTTTCTCCTTGAGCACC 
TGCACTATTGCCAGATCAACATCATTGCACACACATTTTGTGAGCACATGGGCATTGCCCA 
TCTGTCCTGTTCTGATATCTCCATCAATGTCTGGTATGGGTTGGCAGCTGCTCTT(^CCA 

CAGGCCTGGACATCATGCTTATTACTGTTTCCTA 

CT^CTC^G^ 

rr^ArTrTTCTATGTCCCTGCCCTTTTTTCTGTCTTTGCCTACAGGTTTGGTGGGAGAAGCA 
TCCCATGCTATGTCCATATTCTCCTGGCCAGCCTCT 

CCCGTTATTTATGGAGTGAGGACTAAGCCAATACTGGAAGGGGCTAAGCAGATGTTTTCA 
AATCTTGCCAAAGGATCTAAATAA (SEQ ID NO: 346) 
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ATGTCTACCGTGCTCCTCTTC™^ 

TTCGGAGCATTTATCACCAGTAGTGCACATGGTCATGGCTGATATATACCTACTGCTCCCG 
CCTGTGCTAAACCCCATTGTCTACAGTGTGAAGACCAAGCAAATTTGA (SEQ ID NO: 338) 

AOLFRI84 sequences: 

MSTLPTQIAPNSSTSMAPTFLLVGMPGLSGAPSWWTLPLIAVYLLSALGNGTILWI1ALOPALHR 

RALAICRPLHYPALLTNGV1SKJSLAISFRCLGLHLPLPFLLAYMPYCLPQVLTOSYCLHPDVART. 

ACPEAWGAAYSLFVVl^AMGLDPLLIITSYGLIGKVLQGVESREDRWKAGQTCAAHLSAVLLF 

™MILLALINHPELPITQHTHTLLSYVHFLLPPLINPILYSVKM^ 

(SEQ ID NO: 339) 

ATGTCAACATTACCAACTCAGATAGCCCCCAATAGCAGCACTTCAATGGCCCCCACCTTCT 

TGCTGGTGGGCATGCCAGGCCTATCAGGTGCACCCTCCTGGTGGACATTGCCCCTCATTGC 

TGTCTACCTTCTCTCTGCACTGGGAAATGGCACCATCCTCTGGATCATTGCCCTGCAGCCC 

GCCCTGCACCGCCCAATGCACTTCTTCCTCTTCrrGCTTAGTGTGTCTGATATTGGATTGGT 

CACTGCCCTGATGCCCACACTGCTGGGCATCGCCCTTGCTGGTGCTCACACTGTCCCTGCC 

TCAGCCTGCCTTCTACAGATGGTTTTTATCCATGTCTTTTCTGTCATGGAGTCCTCTGTCTT 

GCTCGCCATGTCCATTGATCGGGCACTGGCCATCTGCCGACCTCTCCACTACCCAGCGCTC 

CTCACCAATGGTGTAATTAGCAAAATCAGCCTGGCCATTTCITTTCGATGCCTGGGTCTCC 

ATCTGCCCCTGCCATTCCTGCTGGCCTACATGCCCTACTGCCTCCCACAGGTCCTAACCCAT 

TCTTATTGCITGCATCCAGATGTGGCTCGTTTGGCCTGCCCAGAAGCTTGGGGTGCAGCCT 

ACAGCCTATTTGTGGTTCTTTCAGCCATGGGTTTGGACCCCCrGCTrATTTTCTTCTCCTAT 

GGCCTGATTGGCAAGGTGTTGCAAGGTGTGGAGTCCAGAGAGGATCGCTGGAAGGCTGGT 

CAAACCTGTGCTGCCCACCrCrCTGCAGTGCTCCrCTTCTATATCCCTATGATCCTCCTGGC 

ACTGATTAACCATCCTGAGCTGCCAATCACTCAGCATACCCATACTCTTCTATCCTATGTCC 

A^CCTTCTTCCTCCATTGATAAAeCCTATTCTCTATAGTGTCAAGATGAAGGAGATTAGA 

AAGAGAATACTCAACAGGTTGCAGCCCAGGAAGGTGGGTGGTGCTCAGTGA (SEQ ID NO: 



MFYPILhTOISTKNNSMMSCCMLFIKTVEnLVYNQTQSPWYPr^ 

VDCWSRDHINQSMVLASGNSSSHPVSFILLGIPGLESFQLWMJFPFCATYAVAVVGMTLLHVIR 
IDHTLHEPMYLFLAMLAITDLVLSSSTQPKMLAIFWFHAHEIQYHACLIQVFFIHAFS 
MAMAI^CYVATCFPLPOISSILTPSVVIKLGTTVMLRGLLWVSPFCFMVSRMPFCOHOAIPOSYC 
EHMAVLKLVCADTSISRGYGIJFVAFSVAGFDMIVIGMSYVMn.RAVLQLPSGEARL^sW 

ATGTTCTACCCCATTTTGAATGACATAAGTACGAAAAACAACAGTAACATCATGTCATGTT 

GTAACATATTATTTATTAAAACAGTTGAAATTATTCTAGTTTATAATCAAACCCAATCACC 

CTGGTATCCAATAGTCCCATCCAAAAGCCTTGTATATAATAATAACACTTGTTTTGATTGTT 

ATCATCTGCAGAGAGTAGATTGCGTTCCCAGCAGAGACCATATTAACCAGTCCATGGTGCT 

GGCTTCAGGGAACAGCTCTTCTCATCCTGTGTCCTTCATCCTGCTTGGAATCCCAGGCCTG 

GAGAGTTTCCAGTTGTGGATTGCCTTTCCGTTCTGTGCCACGTATGCTGTGGCTGTTGTTGG 

AAATATCACTCTCCTCCATGTAATCAGAATTGACCACACCCTGCATGAGCCCATGTACCTC 

rrTCTGGCCATGCrGGCCATCACTGACCTGGTCCTCTCCTCCTCCACTCAACCTAAGATGTT 

GGCCATATTCTGGTTTCATGCTCATGAGATTCAGTACCATGCCTGCCTCATCCAGGTGTTCT 

TCATCCATGCCTTTTCrTCTGTGGAGTCTGGGGTGCTCATGGCTATGGCCCTGGACTGCTAC 

GTGGCTACCTGCTTCCCACTCCGACACTCTAGCATCCTGACCCCATCGGTCGTGATCAAAC 

TGGGGACCATCGTGATGCTGAGAGGGCTGCTGTGGGTGAGCCCCTTCTGCTTCATGGTGTC 

TAGGATGCCCTTCTGCCAACACCAAGCCATTCCCCAGTCATACTGTGAGCACATGGCTGTG 

CTGAAGTTGGTGTGTGCTGATACAAGCATAAGTCGTGGGTATGGGCTCTTTGTGGCCTTCT 

CTGTGGCrGGCTTTGATATGATTGTCATTGGTATGTCATACGTGATGATTTTGAGAGCTGT 

GCTrCAGTTGCCCTCAGGTGAAGCCCGCCTCAAAGCTTTTAGCACACGTGCCTCCCATATC 

TGTGTCATCTTGGCTCTTTATATCCCAGCCC1 i 1 1 I I C I T 1 CCTCACCTACCGCTrTGGCCAT 
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GGGTCTTCTTGTTCAATGCCATGGGAATTTCACCTAATGCCTGCTTTGCTCAAG^TTCTTC 

ATTCATGGA^CACTGTCATGGAATC 

TGCCATTTCACAATCCCTTAAGATACAGTTC 

GGACnT^TTTTAGCCATT^ 

5 aitaaaatattgtca^ 
a^Iggcct^ 

ATGCTGGACTOGCACTC^^ 

TGCATCTrroGCAGAGAGGCTTAAGGCCCTAAATACCT 

CTCACCTTCTATGTGCCCATCATCACCCTGGCTGCCATGCATCACTTTGCCAAGCACAAAA 
10 GCCCTCTTGTCGTGATCCTTATTGCAGAT^ 

aSgtg^ctct^ 
gtatgtgggagataa (seq id no: 334) 

AOLFR182 sequences: 

15 MTLGSLGNSSSSVSATFLLSGIPGLERMHIWISIPLCFMYLVSIPGNCTILFIIKTERSLHEPMV^L 

^mlalidTgls^ 

LHYVSILT^TVK^RPGLVS 

A^G^VIVSTVGmSLLILFSYALILRTVLS^^ 

IHRFGKQAPHLVQVVMGFMYLLFPPVMNPr^ (SEQ ID NO: 335) 

20 

ATGACCCTGGGATCCCTGGGAAACAGCAGCAGCAGCGTTTCTGCTACCTTCCTGCTGAGTC 

gStccctgggctggagcgcatgcacatctggatctccatcccactgtg 
ga^atgtatctc^ctgtccatgctgg 

25 TCTCOTACAGTOTGGGC 

TTTGCTCAGCTCTVrTTCATTCACTGCTTCTCCTT 

gcc^gaccgctttgtggctatctgccaccccttgcactatg™ 

AGTCATTGGCAGGATTGGCCTGGTCTCTCTGGGTCGTAGTGTAGCACTCATT^^ 

30 CCACCaIga^St^^ 

30 p^^f^CTCTCTACAGTGGGTATA^ 

?CcTgCGCaS 

TGT^CCCACATCTGTGCnTGTGCTGCTCTT 

35 ctttcctcctg^gatgaatcccattgtctacagtgtgaagaccaaacagatccgggatcga 

GTGACGCATGCCTTTTGTTACTAA (SEQ ID NO: 336) 
AOLFR183 sequences: 

ASMIHRFWEHLSPWHMVMADrYLLLPPVLNPIVYSVKTKQI (SEQ ID NO: 337) 

45 atgacgaacttgaatgcatcacaggccaaccaccgtaacttcattctgacaggtatcccag 
cctggg^aaatggtaccatcctagctgtcatcaaggtg 

50 ototcatc^atgta™ 
agISgtggcca^ 

a^gactggaatatctgtcctcacccgggcagtctgtgtggtattccct^ 

atAAAGTGCCTACCCTTCTGCCATTCCAAT^ 

£^?g?gg^^ 

^ ^p^rACACTGGGGCTCGATGlTCTCCTCACT 
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ATGAATGGAATGAATCACTCTCTGGTATCAGAATTTGTATTCATGGGACTCACCAACTCAC 
GGGAGATTCAGCTTCTACITTTTGTTTTCTCITTGTTGTTCTACTTTGCGAGCATGATGGGA 
AACCTTGTCATTGTATTCACTGTAACCATGGATGCTCATCTGCACTCCCCCATGTAT1TCCT 
5 CCTGGCTAACCTCTCAATCATTGATATGGCATTTTGCTCAATTACAGCCCCTAAGATGATTT 
GTGATATTTTCAAGAAGCACAAGGCCATCTCCrTTCGGGGATGTAlTACrCAGATC^OT 
TAGCCATGCTCITGGGGGCACTGAGATGGTGCTGCTCATAGCCATGGCCITTGACAGATAC 

ATGGCCATATGTAAACCTCTCCACTACCTGACCATCATGAGCCCAAGAATGTGTC^TACT 
i n ^TT^GCCACITCCTCrATCATTGGCCTTATCCACrCATTGGTC 
10 GATTTACCTTTITGTGGTCCTAATATCTITGA 

CAGACTTGCCTGTACCAACACCCAAGAACTGGAGTTCATGGTCACTGTCAATAGTGGArTP 

ATTTCTGTGGGCTCCTTTGTCTTGCTGGTAATTTCCTACATCTTCATTCTGTTCACT^^ 

GAAACATTCTTCTGGTGGTCTAGCCAAGGCCCTCTCTACCCTGTCAGCTCATGTCACTGTG 

1 < ^T OTG ^ CmGGG CCACTGATGTTTTTCTACACATGGCCTrCTCCCACA^^^ 

1 5 GGATAAATATCTTGCTATTTTTGATGCATTTATTACTCCTTTTCTGAATCCAGTTATCTACA 

CATTCAGGAACAAAGACATGAAAGTGGCAATGAGGAGACTGTGCAGTCGTCTTGCGCATT 
TTACAAAGATTTTGTAA (SEQ ID NO: 330) 

AOLFR180 sequences: 

20 MTNKMYAIYIKNLNYFSFLIVQCLQPTMAI^^ 

ALLGNSMIFLVUTKRRLHKPMYWLSMLAAVDLCLTITTLPT\^GVLWFHAREISFKACFIOMF 

FVHAFSLLESSVLVAMAFDRFVAICNPUWATILTDPJVT^LVIGLVICIRPAWLLPLLVA^ 

HGGHELSHPFCYHPE\OKYTYSKJ>WISSFWGLFLQLYLNGTD\OJ^FSYVLILRTVLGrVARKK 

QQKALSTCVCHICAVTIFY^LISLSI^HI^FHSTPRVLCSTLANIYLLLPPVLNPUYSLKTKTIR 
25 QAMFQLLQSKGSWGFNVRGLRGRWD (SEQ ID NO: 33 1) m-UYbl^iKIlR 

ATGACTAATAAAATGTATGCTATATATATAAAGAATCTTAATTATTTTTCTTTCCTCATAGT 

TCAGTGTCTTCAACCAACCATGGCAATATTCAATAACACCACTTCGTCTTCCTCAAACTTCC 

, ft ^9I CACTGCA ^CCCTGGGCTGGAATGTGCTCATGTCTGGATCrCCATrCCAGTCTGCTGT 

30 CTCTACACCATTGCCCTCTTGGGAAACAGTATGATCTTTCnTGTCATCA-rTACTAAGCGGA 

GACTCCACAAACCCATGTATTATTTCCTCTCCATGCTGGCAGCTGTTGATCTATGTCTGACC 

ATTACGACCCrTCCCACTGTGCTTGGTGTTCTCTGGTTTCATGCCCGGGAGATCAGCTTTAA 
AGCTTGCTTCATTCAAATGTTCTTTGTGCATGCTTTCTCCr^ 

TAGCCATGGCCTTTGACCGCTTCGTGGCTATCTGTAACCCACTGAACTATGCTACTATCCTC 
ACAGACAGGATGGTCCTGGTGATAGGGCTGGTCATCTGCATTAGACCAGCAGTTTTCTTAC 
TTCCCCTTCITGTAGCCATAAACACTGTGTCTTTTCATGGGGGTCACGAGCTTTCCCATCCA 
TTTTGCTACCACCCAGAAGTGATCAAATACACATATTCCAAACCTTGGATCAGCAGTTTTT 
GGGGACTGTTTOTCAGCTCTACCTGAATGGCACTGACGTATTGTTTATTCTI^CTCCTAT 
An G I C ™ ATCCTCCGTACTOTCT GGGCATTGTGGCCCGAAAGAAGCAACAAAAAGCT 
40 GCACTTGTGTCTGTCACATCTGTGCAGTCACTATTTTCTATGTGCCACTGATCAGCCT 

TOGCACACCGCCTCTTCCACTCCACCCCAAGGGTGCTCTGTAGCACrrTGGCCAATA^A 
^™ OTACCACCTGTC ^ 

CAGGCTATGTTCCAGCTGCTCCAATCCAAGGGTTCATGGGGTTTTAATGTGAGGGGTCTTA 
GGGGAAGATGGGATTGA (SEQ ID NO: 332) 

45 

AOLFR181 sequences: 

MSVLNNSEVKLFLLIGIPGLEHAHIWFSfflCIMYLLA 

AVSDMGLSLSSLPTMLRWLFNAMGISPNACFAQEFFIHGFIAnviESSVLLIMSLDRFLAJHNPLR 
YSSILTSNRVAJCMGLILAIRSILLVIPFPFTLRPJLKYCQKNLLSHSYCLHQDTMKLA(^ 
iU IYGFFIALCTMLDLAIJVI^YVLIIXTILSIASLAERLKALNTCV SHICAVLTFYVPIITLAAMHHF 
AKHKSPLVVILIADMFIXVPPLMNPIWCVK^ (SEQ ID NO: 333) 

ATGTCTGTTCTCAATAACTCCGAAGTCAAGCTTTTCCTTCTGA'TTGGGATCCCAGGACTGG 
AACATGCCCACATTTGGTTCTCCATCCCCATTTGCCTCATGTACCTGCTTGCCATCATGGGC 
:>:> AACTGCACCATTCTCTTTATTATAAAGACAGAGCCCTCGCTTCATGAGCCCATGTATTATTT 
CCTTGCCATGTTGGCTGTCTCTGACATGGGCCTGTCCCTCTCCTCCCTTCXTACCATGTTGA 

J31 
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AOLFR177 sequences: 

MSFFFVDLRPM^SATHIVTEFILLGFPGCWmQIFLFSLFLVlYVLTLLGNGAIIYAWCOTLLH 
TPMWLLGNFAFLEIWYVSSTIPNMLVNILSKTKAISFSGCFLQFYFFFSLGTTECLFLAVMAYD 
5 RYLMCHPLOYPAIMTVRFCGKLVSFCWLIGFLGYPIPIFYISQLPFCGPNIIDHFLCDMDPLMAL 

SCAPAPITECIFYTQ^ 

TVMVMYVSPTYGIPTLLQKILTLVYSVTTPLFWLIYTLRNKJ3MKLALRNVLFGMRIRQNS 
(SEQ ID NO: 325) 

1 0 ATGTCirrCTTCITrGTAGACTTAAGACCCATGAACAGGTCAGCAACACACATCGTGACAG 
AG^ATTCTCCTGGGATTCCCTGGTTGCTGGAAGATTCAGATITrCCTCTTCTCATTGTTr 
TTGGTGATTTATGTCTTGACCnTGCTGGGAAATGGAGCCATCATCTATGCAGTGAGATGCA 
ACCCACTACTACACACCCCCATGTACTTTCTGGTGGGAAATTTTGCCTTCCTTGAGATCTGG 
TATGTGTCCTCCACTATTCCTAACATGCTAGTCAACATTCTCTCCAAGACCAAGG 

15 ATTTTCTGGGTGCITCCTCCAGTTCTATTTCT^ 

TTCTGGCAGTAATGGCrTATGATCGATACCTGGCCATCTGCCACCCACTGCAGTACCCTGC 

caSatgactgtaaggttctgtggtaagctggtgtct^^ 

q^YACCCAA^CCCATTTTCTACATCTCCCAACTCCCCTTCTGTGGTCCTAATATCATTGAT 
CACTTCCTGT^TGACATGGACCCATTGATGGCTCTATCCTGTGCCCCAGCTCCCATAA 

■20 aatgt^tttctatactcagagctcccttgtcctctttttcacta 
tcctatatcctg^actaacag 

rr^CTCTAC^CTGTGGTTCTCATTTGGTTGTGGTATCTCTTTTCTATGGGACAGTCATGGTA 
ATCTATGTAAGTCCTACATATGGGATCCCAACTTTATTGCAGAAGATCCTCACACTGGTA^T 

a^Jagtaao^ 

25 ^GCTCTCAGAAATGTCCTGTTTGGAATGAGAATTCGTCAAAATTCGTGA (SEQ ID NO: 
326) 

AOLFR178 sequences: 

MVGANHSVVSEFWLGLTNSWEIRLLLLWSSMFYMASMMGNSLILLTVTSDPHLHSPI^LL 
30 ANLSFIDLGVSSWSPKMIYDLFRKHEVISFGGCIAQIFFIHVIG 
LOYLTIMSPRMCMFFLVAAWW 

PSPSTHLDKFLAIFDAVLTPVLNPIIYTFRN (SEQ ID NO: 327) 

35 ATGGTTGGGGCAAATCACTCCGTGGTGTCAGAGTTTGTGTTCCTGGGACTCACCAA'TTCCT 
GGGAGATCCGACTTCTCCTCCTTGTGrrCTCCTCCATGTm 

GTTAGCCAACCTCTCCTTCATTGACCTGGGTGTTTCCTCTGTCACrr^ 
ATGACCTGTTCAGAAAGCACGAAGTCATCTCCTTTGGAGGCTGCATCGCrCAAATC^ 
40 CATCCACGTCATTGGCGGTGTGGAGATGGTGCTGCTCATAGCCATGGCCTTTGACAGATAT 
OTGGCCATATGTAAGCCCCTCCAGTACCTGACCATTATGAGCCCAAGAATGTGCATGTTCT 
?OTAGT^GCCTGGGTGACCGGCCnTATCCACTCTGTAG 

AAC^GCCCTTCTGTGGTCCTAATGTATCGGACAGCTTrTACTGTGACCTTCCTCGGTT 

caaacSgcctgcacagacagctaccgactggagttcatggttacagccaacagtggattc 

ATCTCTCTGGGCTCCTTCITC^TACT 

GA^ACACTCTTCAGCTGGTTTATCCAAGGCTCTGTCCACCCTTTCAGCTCACGTCAG 
GTAGTTTTGTTCTTTGGTCCTTTGATTTTTG 

GGATAAGTTTCTGGCCATCTTTGATGCAGTTCTCACTCCTGTTTTAAATCCTATCATCT 
CATTCAGGAATTGA (SEQ ID NO: 328) 

AOLFR179 sequences: 

MNGMNHSWSEFWMGLTNSREIQLLLFWSLLFYFASMMG 

anlIto^csitapkmicdifkkhkaisfrgcitqif^ 

E™WSG^SVGSF^ 

PSP^HLDKYLAIFD AF1TPFLNP VrYIERNKDMKVAMRRLCSRLAHFTKIL (SEQ ID NO: 329) 
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TAATATTCCTCAACATGTCCATATCTTGCTGGCCAATCTTTATGTGGCAGTGCCACCAATGC 
TGAACCCCATTGTCTATGGTGTGAAGACTAAGCAGATACGTGAGGGTGTAGCCCACCGGTT 
CTTTGACATCAAGACTTGGTGCTGTACCTCCCCTCTGGGCTCATGA (SEQ ID NO: 320) 

AOLFRJ 75 sequences: 

MHF^QhTOLNINLJPHLCLfflWSXOAGAFriHRHMKIFNSPSNSSTFrGFmLGFPCPREGOm^ 

LFTVWLLTLMGNGSIICAVHWDQRIJlAPMyiLLANFSFLEICYVTSTWSMI^ 

SGCFLQFyTTFSLGSTECFFLAVMAFDRYIAICRPLRYPTIMTPvRLCTNLVVNCWVLGF 

VNISQMSFCGSWIDHFLCDPAPLLTLTCKKGPVIELWSVl^PLPVFMLFLFIVGSYALVVRAVL 

RWSAAGPJlKAFSTCGSHLAWSIJYGSVLVMYGSPPSKNEAGKQKTVTLFySVVT^ 

YSLRNKDMRKALKKFWGT (SEQ ID NO: 321) «-*-i*u*vi 

ATGCATTTTCTITCCCAAAATGATTTAAATATAAATCTGATTCCCCATCTATGTTTGGACCG 

TCATTCAGTAATTGCTGGTGCTTTTACAATTCACAGGCACATGAAAATCTTCAACAGCCCC 

AGCAACTCCAGCACCTTCACTGGCTTCATCCTCCTGGGCTTCCCTTGCCCCAGGGAGGGGC 

AGATCCTCCTCTTTGTGCTCTTCACTGTTGTTTACCTCCTGACCCTCATGGGCAATGGTTCC 

ATCATCTGTGCTGTGCACTGGGATCAGAGACTCCACGCCCCCATGTACATCCTGCTCGCCA 

ACTTCTCCTTCTTGGAGATATGTTATGTCACCTCCACAGTCCCCAGCATGCTGGCCAACTTC 

CTCTCTGACACCAAGATCATCTCGTTCTCTGGCTGCTTCCTCCAGTTCTACTTTTTCTTCTCC 

TTGGGCTCTACAGAATGCTTTTTCCTGGCAGTTATGGCATTTGATCGATACCTTGCCATCTG 

TCGGCCTCTACGCTATCCAACCATTATGACCAGACGTCTCTGTACCAATCTTGTGGTCAATT 

GCTGGGTACTTGGTTTCATCTGGTTCTTGATTCCTATCGTCAACATCTCCCAAATGTCCTTC 

TGTGGATCTAGGATTATTGACCACTTCCTATGTGACCCAGCTCCTCTTCTAACTCTCACTTG 

CAAAAAAGGCCCTGTGATAGAGCTTGTCTTTTCTGTCTTAAGTCCTCTGCCTGTCTrTATGC 

TCTTTCTCTTCATTGTGGGGTCCTATGCTCTGGTCGTGAGAGCTGTGTTGAGGGTCCCTTCA 

GCAGCTGGGAGAAGAAAGGCTrTCTCCACCTGTGGGTCTCACCTGGCTGTGGTTTCACTGT 

TCTACGGCTCAGTACTGGTCATGTATGGGAGCCCACCATCTAAGAATGAAGCTGGAAAGC 

AGAAGACTGTGACTCTGTTTTATrCTGTTGTTACCCCACTGCTTAACCCTGTGATATATAGT 

CTTAGGAACAAAGATATGAGAAAAGCTCTGAAGAAATTTTGGGGAACATAA (SEQ ID NO- 



AOLFR176 sequences: 

MFFIIHSLVTSVFLTALGPQNRTMHFVTEFVLLGFHGQREMQSCFFSFILVLYLLTLLGNGAIVC 
AVKLDRRLHTPMYILLGNFAFLEIWYISSTVPhlMLVh^ 

LSVMAYDRYIAICRPLHYPSIMTGK^CnLVCVCWVGGFLCYPVPrva.ISQLPFCGPNnDHLVCD 

PGPU^ALACISAPSTELICYTFNSMIIFGPFLSILGSYTLVIRAVLCIPSGAGRTKAFSTCGSHLMV 

VSLFYGTLMVMYVSPTSGNPAGMQKJITLVYTAMTPFLNPLrfSLRNKDMKDALKRV^ 
QN (SEQ ID NO: 323) 

ATGTTCTTTATTATTCATTCTTTGGTTACTTCTGTTTTTCTAACAGCm 

CAGAACAATGCATTTTGTGACTGAGTTTGTCCTCCTGGGTTTCCATGGTCAAAGGGAGATG 

CAGAGCTGCTTCTTCTCATTCATCCTGGTTCTCTATCTCCTGACACTGCTAGGGAATGGAGC 

TATTGTCTGTGCAGTGAAATTGGACAGGCGGCTCCACACACCCATGTACATCCTTCTGGGA 

AACTITGCCTTTCTAGAGATCTGGTACATTTCCTCCACTGTCCCAAACATGCTAGTCAATAT 

CCTCTCTGAGATTAAAACCATCTCCTTCTCTGGTTGCTTCCTGCAATTCTAl rTCTTTnTTC 

ACTGGGTACAACAGAGTGTTTCTTTTTATCAGTTATGGCTTATGATCGGTACCTGGCCATC 

TGTCGTCCATTACACTACCCCTCCATCATGACTGGGAAGTTCTGTATAATTCTGGTCTGTGT 

ATGCTGGGTAGGCGGATTTCTCTGCTATCCAGTCCCTATTGTTCTTATCTCCCAACTTCCCT 

TCTGTGGGCCCAACATCATTGACCACTTGGTGTGTGACCCAGGCCCATTGTTTGCACTGGC 

CTGCATCTCTGCTCCTTCCACTGAGCTTATCTGTTACACCTTCAACTCGATGATTATCTTTG 

GGCCCTTCCTCTCCATCTTGGGATCTTACACTCTGGTCATCAGAGCTGTGCTTTGTATTCCC 

TCTGGTGCTGGTCGAACTAAAGCTTTCTCCACATGTGGGTCCCACCTAATGGTGGTGTCTC 

TATTCTATGGAACCCTTATGGTGATGTATGTGAGCCCAACATCAGGGAACCCAGCAGGAAT 

GCAGAAGATCATCACTCTGGTATACACAGCAATGACTCCATTCTTAAATCCCCTTATCTAT 

AGTCTTCGAAACAAAGACATGAAAGATGCTCTAAAGAGAGTCCTGGGGTTAACAGTTAGC 
CAAAACTGA (SEQ ID NO: 324) 
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CTTTCTGTACAGTCTACCAATCTGTGGCCCCAATGTTATTC 

a^Sattg^tggaactgttg 

TCGTGG^^^GTATGGTCATCTTTACCITTCTGCT^ 

actttcccttt^ataaattaatgactgtgttttattca^ 

^t^atactcot^ 
aIotaagtatagttagaaaaagagtatctccca 

AGGCAACAAATAGGCGGTAA (SEQ ID NO: 316) 
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AOLFR172 sequences: 
AA^G^^VLV™ 

v^wgn7<^ 

AF^PGLFSYLAHRFGHHTVPKPVHILLSMYLLLPPALNPLIYGARTK 
(SEQ ID NO: 317) 



20 ^^^SS^^^^iS^™ 

ATAGC^C^AGGGTT^jCTC 



GCCTGGTCCAGATGTTCTTTGTACATGCACTGACTGCCATGGAATCAGGTGTGC^ 
CATCG^CTGTGATCGTGCTGCGGC^ 

AAArr^GTGTGGGTTATGCAGCClTGGCCCTGGCACTGAAAGCTGTGGCTA™ 
C^CC?ACTGCT^ 

TTG^GCACACATGGCAGTGGTAGAACTGGTGGTGGGTAACACACAGGCCACCAACTTATA 

SgtSgI^a^^^ 

SacSaSgcccatgctgtgctgcagctacctacccgggaggcccatc 

CTCGCACACOG^^TTGGTCATCACACT 

SaotSctgcSccaotgccctcaactc 

^QA^^CC^A^rcC^GGA^^CCTrCACATTCAGAAAAAGCCCGTTGTAA (SEQ ID NO: 318) 



AOLFR173 sequences: 

MSHTNVTIFHPAWVLPGIPGLEAYHIWLSIPLCLIY^ 
MTA^TTl^TOKALAff^ 

40 nIwygfswivmvildviliavsyslilraw 
tootgWqh^illa^^^ 

(SEQ ID NO: 3 19) 

ATGAGTCACACCAATGTrACCATCTTCCATCCTGCAGTTTTTGTCCTTCCTGGCATCCCTGG 

45 S™gIggc^atcaca^ggctgtcaatac 

Tr^rAAACAGCAT^CTGATAGTGGTTATTGTCATGGAACGTAACC!TTCATGTGCCCATGTA 
TTTCTnrcCTCTCAATG^TGGCCGTCATGGACATCCT 

^CTTTGTCCATATGATGTTTGTGGGGGAGTCAGCTATCCT 

so S^ggcca™tgccccactgagatatacaa 

A^A^r^CTGGCCGTC^^ 

r A ArfVCCTGC^ClTCTGCCTAACCAAC 

g^gSc^ag^ 

TGTrATGGTCATC^GGATGTTATCCTCATCGCTGTGTCTTACTCACTGATCCT 
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GGATTGCTCTTCCTGGGATGCAGACAGTATCTTT 

TGGGACCATGGAGTTTGCATTACTTGGAGTGATGGCTGTGGACCGrTATGTGGCTGTGTGT 
AACCCTTTGAGGTACAACATCATTATGAACAGCAGTACCTGTATTTGGGTGGTAATAGTGT 
CATGGGTGTTTGGATTTCTTTCTGAAATCTGGCCCATCTATGCCACATTTCAGTTTACCTTC 
5 CGCAAATCAAATTCATTAGACCATTTTTACTGTGACCGAGGGCAATTGCTGAAACTG^^CT 
GCGATAACACTCTTCTCACAGAGTTTATCCITTTCTTAATGGCrGTITITA 

TCTTTGATCCCTACGATrGTCTCCTACACCTACATTATCTCCACCATCCTCAAGATCCCGTC 
AGCCTCTGGCCGGAGGAAAGCCTTCTCCACTT1TGCCT 

TAAGATAGTTTCCCTGTTGGTTTCTGTGTTAACCCCCTTCCTGAATCCTTTCATCT 
AOLFR170 sequences: ' ""' " 

MSFTSLIPSLCFSLTLPFIJFC^SLLPFI^AFLFITOWLLAFLSLFSVSVPVSSVSSSMVLCLYT.SV^ 

ASPSWOFSCMQGPILWIMANLSQPSEFVLLGFSSFGELQALLYGPFLMLYLLAFMG>nTnVM^[ 

ADTHLHTPMYFFLGNFSLLEILVTMTAVPRMLSDLLWHKVITFTGCMVQFYFHFSLGSTSFLIL 

TDMALDRFVAICHPLRYGTLMSP^CVQl^GAAWAAPFLAMVPTNaSP^^DYCHGDVlNH 

FFCDNEPLLQLSCSDTRLLEFWDFI^tALTFVLSSFLVTLISYGYIVTTVLPJPSASSCQ 

SgLC^qIsEQID^ 

ATGTCTTTCACTTCTCTCATACCCTCACTCTGTTTCTCCTrGACTCTCCCATTCCTGTTTTGT 
^TrcTTTCTTTATTGCCGTTTCTTTCTGCTITTCTGT^^ 

CTCTCTCTATTCT'CTGTCTCTGTCCCTGTTTCTTCTGTTTCAAGTTCAATGGTTCTCTGT 
I A T^ CTGmCTGOT ra^ 

TGGATCATGGCAAATCTGAGCCAGCCCrCCGAATTTGTCCTCTTGGGCrTCTCCTCCTTTGG 

^ CGATCATCATAG " ATGGTCATAGCT GACACCCACCTACATACACCCATGTACT^ 
CTGGGCAATTTTTCCCTGCTGGAGATCTTGGTAACCATGACTGCAGTGCCCAGGATGCTCT 

CACrTTTCCCTGGGGTCCACCTCCTTCCTCATCCTGACAGACATGGCCCTTGATCGCTTTGT 
GGCCATCTGCCACCCACTGCGCTATGGCACTCTGATGAGCCGGGCTATGTGTGTCCAGCTG 
? C J G ^ G ^ GCCTGGGCAGCTCCmCCTAGCCATGGT ^ 

iT^ A ™ CTGCCATGGCGACGTCATCAACCAOTC ^ CTGTG 

CAGTTGTCATGCTCTGACACTCGCCTGTTGGAATTCTGGGACTTTCTGATGGCCTTGACOT 

I2T^ CAGCTCC " CCTGGTGACCCTCATCTCCTA ^^ 

GGATCCCCTCTGCCAGCAGCTGCCAGAAGGCTTTCTCCACTTGCGGGTCTCACCTCACACT 

^J°TI™ CGGCTACAGTAGTACCATC ^ 

GTGCAAGTCAGGAAGGTCGTGGCCTTGGTGACTTCAGTTCTCACCCCCTTTCTCAATCCCT 

™ c i^ A i^^ 

GAAAGGCCTTTGCAAGGCACAATGA (SEQ ID NO: 314) 
AOLFR171 sequences: 

MVGNLLIWVTTIGSPSLGSLMYFFLAYLSLMDAT^ 

EHIXGGAEWIXVVMAYDRYVAISKJ'LHYLMNlNRLVCnXLWAMIGGFWSW 
ICGPlWU)HSVCDMYPLLELLCLDTYFIGLTWANGGnCMV^ 

KNLWCEKLSIVTUCRVSPTLNIFIPSSKATNRR (SEQ ID NO: 315) 

ATGGTGGGAAACCTCCTCATTTGGGTGACTACTATTGGCAGCCCCTCCTTGGGCTCCCTAA 
TGTACTTCTrCCTTGCCTACTTGTCACTTATGGATGCCATATATTCCACTGCCATGTCACCC 
AAATTGATGATAGACTTACTCTGTGATAAAATCGCTATTTCCTTGTCAGCTTGCATGGGTC 
AGCTCTTCATAGAACACTTACTTGGTGGTGCAGAGGTCTrrCCTTTTGGTGGTGATGGCCTA 
TGATCGCTATGTGGCTATCTCTAAGCCGCTGCACTATTTGAACATCATGAATCGACTGGTr 
TGCATCCTTCrGTTGGTGGTGGCCATGATTGGAGGTTTTGTGCACTCTGTGGTTCAAATTGT 



127 



PCT/US01/20122 

WO 01/98526 

FVYLRPSSSYLLGM3KWSWYSLVIPMLNPLIYSLR>n<^IKDALWKVLERKKVFS (SEQ ID 
NO: 307) 

ATGTCCATAACCAAAGCCTGGAACAGCTCATCAGTGACCATGTTCATCCTCCTGGGA^CA 

5 cIgACCATOAGA^ 

CTGGCCTGGAACCTGGCCCTCA 

aI^aStcactgTcttcttctgggagcagaagaccatatca™ 
otacacgJ1?Sctgg^Sggcatatg^ 

CTCCATAmAGGC^CA 

Sccagtotg^^ 

SSctgaIccctctcatttacagtttgaggaacaaagagatcaaggatgccct^^ 

GTGTTGGAAAGGAAGAAAGTGTTTTCTTAG (SEQ ID NO: 308) 



20 



25 



45 



AOLFR1 68 sequences: 

^^s^aSSv^sysiilvslrkqsaegiu^ 

^SED^VaWtH™^ (SEQ ID NO: 

309) 



30 ^^^^^^^^"^^"^B 

AT^CTTTGGTTGCACT^AGATCnTCATCCTTACT 
IrrrmGTGGACCCAATGAGATAGATCACTACT^ 

40 caotcagcagaaggcaggcgcaaagccctctccacctgtc 

A^GA^GCTGTATTrrACACCATTATCACTC^^^^ 



GAGAAATGCAGAAGTAAAGAATGCAATGAAGAAACTGTGGGGCAGAAATG 
GGCTAAAGGGAAATAG (SEQ ID NO: 310) 



NIIMNS STCIWVVIVSW VFGFLSEIWPIY ATFOFTFRKSNSLDHt 1 Y UJKlAJLLh 
50 LFLMAWILIGSLIPTIYSYTYIISmKIPSASGRRKAFS^ASI^ 

G\^YNKWSLLVSVLTPFLNPFIFT1.RNDKVKEALRDGMKRCCQLLKD (SEQ 

Sa^^ 
^ a£™tca^ca^ 
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AOLFR165 sequences: 

MAVGRNNTIVTKFILLGLSDHPQMKIFLFMLFLGLyLLTLAWNLSLIALIKMDSHl^ 
Sl^FIJDICYVSSTAPKMI^DllTF.OKTISWrjnATOVTrv^^yr^T-TO^TT * „. . tl_ 



«««« ,1V J ^ J ^"^^" r ViviisjrLrM^LOLyLLTl^WNLSLIALIKMDSHlJINff 

S^SFU)ICYVSSTAPKMI^D]ITEQKTISFVGCATQYFWCGMGLTEO 

PIXYTVLJSHTLClJGVrVVGAYVGGFLSSFIETTSVYQHDFCGPYMINHFFC^ 

TSEWTFIVSWGIVSVL^ISYGYIVAAVVKJSSATGRTKAFSTC^SHLTAVTU^GSGFFM 
YMRPSSSYSIJ<W>KWSIFYALVIPVVNPII^^ 



ATGGCTGTAGGAAGGAACAACACAATTGTGACAAAATTCATTCTCCTGGGACTrTCAGArr 
ATCCTCAAATGAAGATTTTCCTTTTCATGTTATTTCTGGGGCTCTACCTCCTGACGTTGGCC 
TGGAACTTAAGCCTCATTGCCCTCATTAAGATGGACTCTCACCTGCACATGCCCATG'TACr 
TCITCCTCAGTAACCTGTCCITCCrGGACATCTGCTATGTGTCCrcCACCGCCCCTAAGATG 
CTGTCTGACATCATCACAGAGCAGAAAACCATTTCCTTTGTTGGCTGTGCCACTCAGTACT 
TTGTCTTCrGTGGGATGGGGCTGACTGAATGCTTTCTCCTGGCAGCTATGGCCTATGACCG 
GTATGCTGCAATCTGCAACCCCTTGCTTTACACAGTCCTCATATCCCATACACITrG^^ 
AGATGGTGGTTGGCGCCTATGTGGGTGGATTCCTTAGTTCTTTCATTGAAACATACTCTGT 
CTATCAGCATGATTTCTGTGGGCCCrATATGATCAACCACTTTTTCTGTGACCTCCCTCCAG 



TCCTGGCTCTGTCCTGCTCTGATACCTTCACCAGCGAGGTGGTGACCTTCATAGTCAGTGTT 

GTCGTTGGAATAGTGTCTGTGCTAGTGGTCCTCATCTCTTATGGTrACATTGTTGCTGCTCT 

TGTGAAGATCAGCTCAGCTACAGGTAGGACAAAGGCCTTCAGCACTTGTGCCTCTCACCTG 

ACTGCTGTGACCCrCTTCTATGGTTCTGGATTCTTCATGTACATGCGACCCAGTTCCAGCTA 

CTCCCTAAACAGGGACAAGGTGGTGTCCATATTCTATGCCTTGGTGATCCCCGTGGTGAAT 

CCCATCATCTACAGTTTTAGGAATAAGGAGATTAAAAATGCCATGAGGAAAGCCATGGAA 

AGGGACCCCGGGATTTCTCACGGTGGACCATTCATTTTTATGACCTTGGGCTAA (SEO ID 
NO: 304) 

AOLFR166 sequences: 

MEMENCTRVKEFIFLGLTQNREVSLVLFLFLLLVYVTTLLGNLLIMVTVTCESRLHTPMY^ 
NLSIADICFSSITWKVLVDLI^ERKTISFNHCFTQMFLFHLIGGVDWSI^VMALDRYVAISKPL 
HYATIMSRDHCIGLTVAAWLGGFVHSrVQISLLLPLPFCGPNVLDTFVCDVHRVIXIAHTO 
ELLMlSNNGLLTTLWFFLLLVSYIVILSLPKSQAGEGPvPJCAJSTCTSHITWTLHFWCn^ARP 
FTALPMDKAIS VTFTVISP LLNPLIYTLPJ^fflEMKS AMRRLKRRL.VPSDRK (SEQ ED NO: 305) 

ATGGAGATGGAAAACTGCACCAGGGTAAAAGAATTTATTTTCCTTGGCCTGACCCAGAATr 
GGGAAGTGAGCTTAGTCTTATTTCTTTTC^ 

AACCTCCTCATCATGGTCACTGTTACCTGTGAATCTCGCCTTCACACGCCCATGTATTTTTT 
GCTCCATAATTrATCTATTGCCGATATCTGCTTCTCTTCCATCACAGTGCCCAAGGTTCTGG 
TGGACCTTCTGTCTGAAAGAAAGACCATCTCCTTCAATCATTGCTTCACTCAGATGTTTCTA 
TTCCACCTTATTGGAGGGGTGGATGTATTTTCTCITTCGGTGATGGCATTGGATCGATATG 

CACAGTGGCTGCCTGGTTGGGGGGCTTTGTCCACTCCATCGTGCAGATTTCCCTGTTGCTC 

CCACTCCCTTTCTGCGGACCCAATGTTCTTGACACTTTCTACTGTGATGTCCACCGGGTCCT 

CAAACTGGCCCATACAGACATTTTCATACnTGAACTACTAATGATTTCCAACAATGGACTG 

CTCACCACACTGTGGTTTTTCCTGCTCCTGGTGTCCTACATAGTCATATTATCATTACCCAA 

GTCTCAGGCAGGAGAGGGCAGGAGGAAAGCCATCTCCACCTGCACCTCCCACATCACTGT 

^J?^ CCCTGCAmCGTGCCCTGCATCTATGTCTATGCCCG ^ 

TGGATAAGGCCATCTCTGTCACCTTCACTGTCATCTCCCCTCTGCTCAACCCCTTGATCTAC 

ACTCTGAGGAACCATGAGATGAAGTCAGCCATGAGGAGACTGAAGAGAAGACTTGTGCCT 
TCTGATAGAAAATAG *SEQ ID NO: 306) oaauaui iuiolui 

AOLFR167 sequences: 

MSITKAWNSSSVTMFILLGFTDHPELQAIXFVTFLGI^ 

LSNl^FTOICYSSAVAPNMLTOFFWEQKTISFVGCAAQFFFFVGMGLSECLLLTAMAYDRYAAJ 
SSPLLYPTIMTQGLCTRMWGAYVGGFl^SLIQASSIFRLHFCGPhniNHFFCDLPPVLALSCSDT 
FI^QVVNFLVVVTVGGTSFLQLLISYGYrVSAVI.KTPSAEGRWKACNTCASHL 
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GTGTATCTTCATTGCCGTCTTGAAGATGCCnTCGTTAGAGGGCAGGCACAAAGCCTTCTCC 

ACCTGTGCCTCTTACCTCATGGCTGTCACCATATTCm 
GCGCCCTACATCTAGCTACTCAATGGAGCAAGACAAGGTTGTCT^ 
ATAATrcCTGTGCTAAATCCCCTCATCTATAGTTTAAAAAATAAGGAT^ 
5 TAAAGAAGATCTTATGGAAACACATCTTGTAG (SEQ ID NO: 298) 

AOLFR163 sequences: 

SYSFDMDKIVSTFYTWFPMLNLM1YSLRNKDVKEALKKLLP (SEQ ID NO: 299) 
ATGCAGAGGAGCAATCATACAGTGACTGAGTTTATACTGCTGGGCTTCACCACAGACCCA 

15 ggaItgcagc^ 

ATAGCACOCTCATCGTGTTGATCT 

actggaaatctgtcgtttctggatctctggtat^ 

GAOCTGCATCTCTGAAGACAAAAGCATCTCCITrGCTGGCT 

20 g^ccatctccaXgcccctgctttatc 

GTAGCAGTCTCATATTGTGGTGGCTTTATTAACTCTTCAATCATCACCAAGAAAACGTTTrc 

aS^^gtggcgagaagggcggct 

ggat^tcctcctccaagggctacctcaaagccttctccacatgctcct 
gtcacttStactatggctccattctctacatctacgctctccccaga 

TGATATOGACAAAATAGTTTCnrACATTTTACACT 

1?ctaca^c^ 

ID NO: 300) 

AOLFR164 sequences: „^„^ T 

mflternttseatftllgfsdylelqiplffwlavygfsw 

(SEQ ED NO: 301) 

ATGTrTCTGACAGAGAGAAATACGACATCTGAGGCCACATTCACTCTCnTGGGCTTCTCAG 

SSgg^totg^ 

AT^T^OTCAACCACCTCTCCITTGT 
CTTTTrc^nTGCACCTTTGTAGTGACTGAATTAAT^ 

45 acSgtggc^a^gcaatcct 

• ATGCTGOT'GGTTGTATTGTATGCATGGGGAGTCGCATGTTCCCTGACACTCGCGTGCT 

CTTTAAAGTrATCTTTTCATGGTTTCAAC^ 

CTGATATCACTCTCTTACCCTGACT^ 

tPttaATGAGATAA^ 

50 otgaIg1?gc^ 

GACTGCCATCACCATCTTCCATGGCACCATCCTCTTCCTCTACTGTGTACCCAACTC 
ACTCC^GGCACACAGTCAAAGTGGCCTCTGTGm 

55 A (SEQ ID NO: 302) 
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TTGTCTTGTTTTITGCrGGCTC 

GTGGCCATTTGTCACCCCCTTAGATACACAGTCATCATGAACCCCCGCCTCTGTGGCCTGC 

TGATTCTTCTCTCTCTGTTGACTAGTGTTGTGAATGCCCTTCTTCTCAGCGTGATGGTGTTG 

AGGCTGTCCTTCTGCACAGACCTGGAAATCCCGCTCTTCTTCTGTGAACTGGCTCAGGTCA 

TCCAACTCACCTGTTCAGACACCCTCATCAATAACATCCTGATATATTTTGCAGCTTGCATA 

TTTGGTGGTGTTCCTCTGTCTGGAATCATTTTGTCTTACACTCAGATCACCTCCTGTGTTTT 

GAGAATGCCATCAGCAAGTGGAAAGCACAAAGCAGTTTCCACCTGTGGGTCTCACCTCTCC 

ATTGTTCTCTTGTTCTATGGGGCAGGTTTGGGGGTGTACATTAGTTCTGTGGTTACTGACTC 

ACCTAGGAAGACTGCAGTGGCTTCAGTGATGTATTCTGTGTTCCCTCAAATGGTGAACCCC 

TTTATCTATAGTCTGAGGAATAAGGACATGAAAGGAACCTTGAGGAAGTTCATAGGGAGO 

ATACCITCTCTTCTGTGGTGTGCCATTTGCTTTGGATTCAGGTTTCTAGAGTAA (SEQ ID 
NO: 294) 

AOLFR161 sequences: 

MEPRNQTSASQFILLGLSEKPEQETLLFSLFFCMYLVMWGNLLIILAISIDSHLHTPMYFFLANL 
SLVDFCLATNTIPKMLVSLQTGSKAISYPCCLIQMYFFHFFGIVDSVIIAMMAYDRFVAICHPIJH 
YAIOMSLRLCRLLVGALWAFSCFISLTHILLMARLWCGSHEWHYFCDLTPILRLSCTDTSVNR 
IFILIVAGMVlATPFVCIlASYARILVAIMKVPSAGGRKKAFSTCSSHLSWAlJ^GTnGVYLCP 
SSVLTTVKEKASAVMYTA\OTMLNPFIYSLRJnIRDLKGALRKLVNRKITSSS (SEQ ID NO: 295) 

ATGGAACCAAGAAACCAAACCAGTGCATCTCAATTCATCCTCCTGGGACTCTCAGAAAAGC 

CAGAGCAGGAGACGCITCTCTTTTCCCTGTTCTTCTGCATGTACCTGGTCATGGTCGTGGG 

GAACCTGCTCATCATCCTGGCCATCAGCATAGACTCCCACCTCCACACCCCCATGTACTTCT 

TCCTGGCCAACCTGTCCCTGGTTGATTTCTGTCTGGCCACCAACACCATCCCTAAGATGCT 

GGTGAGCCTTCAAACCGGGAGCAAGGCCATCTCTTATCCCTGCTGCCTGATCCAGATGTAC 

TTCITCCATTTCTTTGGCATCGTGGACAGCGTCATAATCGCCATGATGGCTTATGACCGGTT 

CGTGGCCATCTGCCACCCATTGCACTACGCCAAGATCATGAGCCTACGCCTCTGTCGCCTG 

CTGGTCGGCGCCCTCTGG GCGTTTTCCTG CTTC ATCTC A CTCA CTCA C ATCCTCCTG ATGGC 

CCGTCTCGTTTTCTGCGGCAGCCATGAGGTGCCTCACTACTTCTGCGACCTCACTCCCATCC 

TCCGACTTTCGTGCACGGACACCTCTGTGAATAGGATCTTCATCCTCATTGTGGCAGGGAT 

GGTGATAGCCACGCCCTTTGTCTGCATCCTGGCCTCCTATGCTCGCATCCTTGTGGCCATCA 

TGAAGGTCCCCTCTGCAGGCGGCAGGAAGAAAGCCTTCTCCACCTGCAGCTCCCACCTGTC 

TGTGGTTGCTCTCTTCTATGGGACCACCATTGGCGTCTATCTGTGTCCCTCCTCGGTCCTCA 

CCACTGTGAAGGAGAAAGCTTCTGCGGTGATGTACACAGCAGTCACCCCCATGCTGAATCC 

CTTCATCTACAGCTTGAGGAACAGAGACCTGAAAGGGGCTCTCAGGAAGCTGGTCAACAG 

AAAGATCACCTCATCTTCCTGA (SEQ ID NO: 296) 

AOLFR162 sequences: 
MMRLMKEVRGRNQTWTE^ 

TPMYFFI^Sl^FVDASYSSSVTPKMLVNLMAENKAISFHGCAAQFYFFGSI^GTE 
YDRYAArVVNPLLYPVLVSGRICFLLIATSFLAGCGNAAIHTGMTFPJ^FCGSNFJNHFYCDTPP 
LKI^CSDTHFNGIVIMAFSSFIVISCVMIVLISYLCIFIAVLKMPSLEGRHKAFSTCASY1>MA 
FGTILFMYT.RCTSSYSMEQDKVVS W YTVIff VIJ^IPLIYSLK (SEO ID 

NO: 297) 

ATGATGAGACTTATGAAAGAGGTTCGAGGCAGAAATCAAACAGAAGTAACAGAATTTCTC 
CTCTTAGGACTTTCCGACAATCCAG ATCTA CAA GG A GTCCTCTTTGCATTGTTTCTGTTGAT 

CTATATGGCAAACATGGTGGGCAATTTGGGGATGATTGTATTGATTAAGA'rTGATCTCTGT 

CTCCACACCCCCATGTATTTCnrTTCTCAGTAGCCTCTCTTTTGTAGATGCCTCTTACTCTTCT 

TCCGTCACTCCCAAGATGCTGGTGAACCTCATGGCTGAGAATAAGGCCATTTCTTTTCATG 

GATGTGCTGCCCAGTTCTACTTCTrTGGCTCCTTCCTGGGGACTGAGTGCTTCCTGTTGGCC 

ATGATGGCATATG ACCGCT ATGCAGCCATTTGGAACCCCCTGCTCTACCCAGTTCTCGTGT 

CTGGGAGAATTTGC 1 1 1 1 1 GCTA ATAGCTACCTCCTTCTTAGCAGGTTGTGGAAATGCAGC 

CATACATACAGGGATGACTTTTAGGTTGTCCrTTTGTGGTTCTAATAGGATCAACCATTTCT 

ACTGTGACACCCCGCCACTGCTCAAACTCTCTTGCTCTGATACCCACTTCAATGGCATTGTG 

ATCATGGCATTCTCAAGTTTTATTGTCATCAGCTGTGTTATGATTGTCCTCATTTCCTACCT 
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ATGAAAGCAGGAAACTTCTCAGACACTCCAGAATTCTTTCTCTTGGGATTGTCAGGGGATC 
CGGAGCTGCAGCCCATCCTCTTCATGCTGTTCCTGTCCATGTACCTGGCCACAATGCTGGG 
GAACCTGCTCATCATCCTGGCCGTCAACTCTGACTCCCACCTCCACACCCCCATGTACTTCC 
5 TCCTCTCTATCCTGTCCTTGGTCGACATCTGTTTCACCTCCACCACGATGCCCAAGATGCTG 
GTGAACATCCAGGCACAGGCTCAATCCATCAATTACACAGGCTGCCTCACCCAAATCTGCT 
TTGTCCTGGTTTTTGTTGGATTGGAAAATGGAATTCTGGTCATGATGGCCTATGATCGATT 
TGTGGCCATCTGTCACCCACTGAGGTACAATGTCATCATGAACCCCAAACTCTGTGGGCTG 
CTGCTTCTGCrGTCCTTCATCGTTAGTGTCCTGGATGCTCTGCTGCACACGTTGATGGTGCT 
10 ACAGCTGACCITCTGCATAGACCTGGAAATTCCCCACTTTTTCTGTGAACTAGCTCATATTC 
TCAAGCTCGCCTGTTCTGATGTCCTCATCAATAACATCCTGGTGTATTTGGTGACCAGCCT 
GTTAGGTGTTGTTCCTCTCTCTGGGATCATTTTCTCTTACACACGAATTGTCTCCTCTGTCA 
TGAAAATTCCATCAGCTGGTGGAAAGTATAAAGCnTITTCCATCTGCGGGTCACATTTAAT 

cgttgtttccttgttttatggaacAgggtttggggtgtaccttagttctggggctacccact 

1 5 CCTCCAGGAAGGGTGCAATAGCATCAGTGATGTATACCGTGGTCACCCCCATGCTGAACCC 
ACTCATTTACAGCCTGAGAAACAAGGACATGTTGAAGGCTTTGAGGAAACTAATATCTAG 

GATACCATCTTTCCATTGA (SEQ ID NO: 290) 
AOLFR159 sequences: 

20 MGPRNOTAVSEFLLMKWEDPELKLIPFSLFLSMYLVTILGNLLILLAVISDSHLHTPMYFLLFN 
LSFTDI CLTTTTWKIL\^QAQNQSITYTGCLTQICLVLWAGLESCFLAVMAYDRYVA^ 
RYTVLMNVHFWGLLILLSMFMSTMDALVQSLM^^ 

INNILIYFASSVFGAIPLSGUFSYSQIVTSVLRMPSARGKYKAFSTCGCHLSVFSLFYGTAFGVYIS 
SAVAESSFJTAVASVMYTVVPQMMWFIYSLRNKEMKKALRKUGRLFPF (SEQ ID NO: 291) 

25 

ATGGGACCCAGAAACCAAACAGCTGTTTCAGAATTTCTTCTCATGAAAGTGACAGAGGAC 
CCAGAACTGAAGTTAATCCCTTTCAGCCTGTTCCTGTCCATGTACCTGGTCACCATCCTGG 
GGAACCTGCTCATTCTCCTGGCTGTCATCTCTGACTCCCACCTCCACACCCCCATGTACTTC 
CTTCTCTrTAATCTCTCCTTTACTGACATCTGTTTAACCACAACCACAGTCCCAAAGATCCT 
AGTGAACATCCAAGCTCAGAATCAGAGTATCACTTACACAGGCTGCCTCACCCAGATCTGT 
CTTGTCTTGGTTTrrGCTGGCTTGGAAAGTTGCnTrCTTGCAGTCATGGCCTACGACCGCTA 
TGTGGCCATTTGCCACCCACTGAGGTACACAGTCCTCATGAATGTCCATTTCTGGGGCTTG 
CTGATTCTTCTCTCCATGTTCATGAGCACTATGGATGCCCTGGTTCAGAGTCTGATGGTATT 
GCAGCTGTCCTTCTGCAAAAACGTTGAAATCCCTTTGTTCTTCTGTGAAGTCGTTCAGGTC 
ATCAAGCTCGCCTGTTCTGACACCCTCATCAACAACATCCTCATATATTTTGCAAGTAGTGT 
ATTrGGTGCAATTCCTCTCTCTGGAATAATTTTCTCTTATTCTCAAATAGTCACCTCTGTTC 
TGAGAATGCCATCAGCAAGAGGAAAGTATAAAGCGTTTTCCACCTGTGGCTGTCACCTCTC 
TGTTTTTTCCTTGTTCTATGGGACAGCTTTTGGGGTGTACATTAGTTCTGCTGTTGCTGAGT 
CTTCCCGAATTACTGCTGTGGCTTCAGTGATGTACACTGTGGTCCCTCAAATGATGAACCC 
40 ciTCATCTACAGCCrGAGAAATAAGGAGATGAAGAAAGCTTTGAGGAAACTTATTGGTAG 

GCTGTTTCCTTTTTAG (SEQ-ID-NOu292) 



30 



35 



AOLFR160 sequences: 

MPMQLLLTDFIIFSIRFIINSMEARNQTAISKFLLLGL1EDPELQPVLFSLFLSMYLVTILGNLLILL 

45 aVISdWhTPMYFFLSNLSFLDICLSTTTIPKMLVN^ 
LLAAMAYDRYVAICHPLRYTVIMNPRLCGLLI^^ 

CELAOVIQLTCSDTLINNILIYFAACIFGGWLSGIIL^YTQITSCVLRMPSASGKHKAVSTCGSHL 
SrVXLFYGAGLGWISSWTDSPRKTAVASVMYSWPQMVNPFr^SLRNKDMKGTIJ^ 
SLLWCAICFGFRFLE (SEQ ID NO: 293) 

50 

ATGCCGATGCAGCTGCTGCTTACAGATTTTATTATCTTTTCCATCAGATTCATCATCAACAG 
CATGGAAGCGAGAAACCAAACAGCTATTTCAAAATTCCTTCTCCTGGGACTGATAGAGGAT 
CCGGAACTGCAGCCCGTCCTrTTCAGCCTGTTCCTGTCCATGTACTTGGTCACCATCCTGGG 
GAACCTGCTCATCCTCTTGGCTGTCATCTCTGACTCTCACCTCCACACCCCCATGTACTTCT 
55 TCCTCTCCAATCrCTCCTTTTTGGACATTTGTTTAAGCACAACCACGATCCCAAAGATGCTG 
GTGAACATCCAAGCTCAGAATCGGAGCATCACGTACTCAGGCTGCCTCACCCAGATCTGCT 

122 



NSDOCID- <WO_0198526A2_I_> 



WO 01/98526 



PCT/US01/20122 



AOLFR156 sequences: 

MCWAMPSPFTGSSTRNMESRNQSTVTEFITO 

. DTHLGNPMYKTISIFSFLEIWYTTATIPKMLSNUSEKKAISMTGC^QMYFFHSLENSEGILLTT 
MAIDRYVAICNPLRYQMIMTPPXCAHLSAGSCLFGFLILLPEIVMISTLPFCGPNOIHOIFCDLVP 
5 VLSLACTDTSMILIEDVlHAVTmTFLIIAl^YVPJVTVI^ 

ATGTGCTGGGCTATGCCCrCTCCATTTACAGGTAGCTCTACTAGAAATATGGAGAGCAGAA 

1 0 ACCAATCAACAGTGACTGAATTTATCTTCACTGGATTCCCTCAGCTTCAGGATGGTAGTCT 

CCTGTACTTCTTTCCTTTACTTTTCATCTATACTTTTATTATCATTGATAAOT 

CTCTGCTGTAAGGCTGGACACCCATCTGGGCAACCCCATGTATAATTTTATCAGTATATTTT 

CCTTTCTGGAGATCTGGTACACCACAGCCACCATTCCCAAGATGCTCTCCAACCTCATCAG 

TGAAAAGAAGGCCATCTCAATGACTGGCTGCATCTTGCAGATGTATTTCTTCCACTCACTT 

GAAAACTCAGAGGGGATCTTGCTGACCACCATGGCCATTGACAGATACGTTGCCATCTGCA 

ACCCTCTTCGCTATCAAATGATCATGACCCCCCGGCTCTGTGCTCACCTCTCTGCAGGTTCC 

TGCCTCTTCGGTTTCCTTATCCTGCTTCCCGAGATTGTGATGATTTCCACACTGCCTTTCTG 

TGGGCCCAACCAAATCCATCAGATCTTCTGTGACTTGGTCCCTGTGCTAAGCCTGGCCTGT 

ACAGACACGTCCATGATTCTGATTGAGGATGTGATTCATGCTGTGACCATCATCATTACCT 

TCCTAATCATTGCCCTGTCCTATGTAAGAATTGTCACTGTGATATTGAGGATTCCCTCTTCT 

GAAGGGAGGCAAAAGGCTNTTTCTACCTGTGCAGGCCACCTCATGGTCTTCCTGATATTCT 

TTGGCAGTGTATCACTCATGTACITGCGTTTCAGCAACACTrATCCACCAGTTTTGGACAC 

AGCCATTGCACTGATGTTTACTGTACTTGCTCCATTCTTCAATCCCATCATTTATAGCCTGA 

GAAACAAGGACATGAACAATGCAATTAAAAAACTGTTCTGTCTTCAAAAAGTGTTGAACA 
25 AGCCTGGAGGTTAA (SEQ ID NO: 286) 

AOLFR157 sequences: 

MAMD>A^TAVFQFLLIGIShT^QWRDTFFTLVLIIYI^TLLGNGFMIFLIOT 

SFLDLCYGTASMPQALVHCFSTHPYl^YPRClAQTSVSlALATAECIXI^AMAyDRVVAISNP 

LRYSVVMNGPVCVCLVATSWGTSLVLTAMLILSLRlJffCGANVINHFACEILSLIKLTCSDTSL 
NEFMILITSIFTLLLPFGFVLI^YIRIAMAIIP^^ 

QSKSSPDQDKFISVFYGALTPMLNPLHfSLRKKDVKRAIRKVMLKRT (SEQ ID NO- 287) 



15 



20 



30 



ATGGCCATGGACAATGTCACAGCAGTGTTTCAGTTTCTCCTTATTGGCATTTCTAACTATCC 

35 TCAATGOAGAGACACGTTTTTCACATTAGTGCTGATAATTTACCTCAGCACATTGTTGGGG 

AATGGATTTATGATCTTTCTTATTCACTTTGACCCCAACCTCCACACTCCAATCTACTTCT^ 

CCTTAGTAACCTGTCTTTCTTAGACCTTTGTTATGGAACAGCnTCCATGCCCCAGGCnTTGG 

TGCATTGTTTCTCTACCCATCCCTACCTCrCTTATCCCCGATGTTTGGCTCAAACGAGTGTC 

TCCTTGGCTTTGGCCACAGCAGAGTGCCTCCTACTGGCTGCCATGGCCTATGACCGTGTGG 

40 TTGCTATCAGCAATCCCCTGCGTTATTCAGTGOTTATGAATGGCCCAGTGTGTGTCTGCTT 

GGTTGCTACCTCATGGGGGACATCACTTGTGCTCACTGCCATGCTCATCCTATCCCTGAGG 

_CTTCACTTCTGTGGGGCTAATGTCATCAACCATTTTGCCTGTGAGATrcrCTCCCTCATTAA 

GCTGACCTGTTCTGATACCAGCCTCAATGAATTTATGATCCTCATCACCAGTATCTTCACCC 

TGCTGCTACCATTTGGGTTTGTTCTCCTCTCCTACATACGAATTGCTATGGCTATCATAAGG 

45 ATTCGCTCACTCCAGGGCAGGCTCAAGGCCTTTACCACATGTGGCTCTCACCTGACCGTGG 

TGACAATCTTCTATGGGTCAGCCATCTCCATGTATATGAAAACTCAGTCCAAGTCCTCCCC 

TGACCAGGACAAGTTTATCTCAGTGTTTTATGGAGCTTTGACACCCATGTTGAACCCCCTG 

ATATATAGCCTGAGAAAAAAAGATGTTAAACGGGCAATAAGGAAAGTTATGTTGAAAAGG 
ACATGA (SEQ ID NO: 288) 



50 



55 



AOLFR158 sequences: 

MKAGNFSDTPEFFLLGLSGDPELQPILFMLFI^MYLATML^ 

LSLVDICFTSTTMPKMLVNIQAQAQSINYTGCLTQICFVLVFVGLF^GILVMMAYDRF^AJCHP 

LRYNVIMNPKLCGLLLLLSFTVSVI1)ALI^TIJV1VLQLTFCIDLEIPHF^ 

NILVYLVTSLLGVWLSGnFSYTRIVSSVMK^ 

GATHSSRKGAIASVMYTVVTPMLNPLrYSLRNKDMlXAI^KLISRIPSFH (SEQ ID NO- 289) 
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ATAGTCACTTTGTITTATGGAACCCTCTTCTGCATGTACGTAAGGCCTCCATCAGAGAAGT 
CTGTAGAGGAGTCCAAAATAACTGCAGTCTTTTATACTTTTTTGAGCCCAATGCTGAACCC 
ATrGATCTATAGCCTACGGAACACAGATGTAATCCTTGCCATGCAACAAATGATTAGGGGA 
AAATCCTTTCATAAAATTGCAGTTTAG (SEQ ID NO: 280) 

5 

AOLFR152 sequences: 

MDOINHThfVK^FFFLELTRSRELEFFLFVWFAVYVATVLGNALIVVTITCESRLHTPMW^ 
KSVEDIVFSSITWKFLVDLLSDRKTISYNDCMAQ1FFFHFAGGADIFFLSVMAYDRYLAIAKPL 
HYVTMMRKEVWVALWASWVSGGLHSIIQVILMLPFPFCGPhm.DAFYCYVLQVVKLACTDT 
1 0 FALELFMISNNGLVTLLWLLLLGSYTVILVMLRSHSGEGRNKALSTCTSHMLVVTLHFVPCV 
YlYCRPFMTLPMDTTISrNNTVITPMLNPIIYSLimQEMKSAMQPXQPJILGPSESRKWG (SEQ ID 

NO: 281) 
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ATGGACCAGATCAACCACACTAATGTGAAGGAGTTTTTCTTCCTGGAACTTACACGTTCCC 
GAGAGCTGGAGTTTTTCTTGTTTGTGGTCTTCTTTGCTGTGTATGTAGCAACAGTCCTGGG 
AAATGCACTCATTGTGGTCACTATTACCTGTGAGTCCCGCCTACACACTCCTATGTACTTTC 
TCCTGCGGAACAAATCAGTCCTGGACATCGTTTTTTCATCTATCACCGTCCCCAAGTTCCTG 
GTGGATCTTTTATCAGACAGGAAAACCATCTCCTACAATGACTGCATGGCACAGATCTTTT 
__ „ . ^ a rp.Tv^^poT^rTr.TrjATRfirrrATfiACAfiATAC 



AOLFR153 sequences: 

MSKTSLVTAFILTGLPHAPGLDAPLFGIFLVVYVLTVLGNLLILLVIRVDSHLHTPMYYFETNLS 
FIDMWFSTVTVPKMLMTLVSPSGRAISFHSCVAQLYFFHFLGSTECFLYTVMSYDRYLAISYPL 
RYTSMMSGSRCALIATSTWI^GSLHSAVQTILTFHLPYCGPNQIQHYLCDAPPILKLACADTSA 

>OjMVIFVDIGLVASGCFLLIVLSYVSrVCSILRIH^ 

PGSPJDVVDGWAIFYTVLTPLLNPVVYTLRNKEVKKAVLKLRDKVAHSQGE (SEQ ID NO: 
283) 



ATGTCCAAGACCAGCCTCGTGACAGCGTTCATCCTCACGGGCCTTCCCCATGCCCCAGGGC 

40 TGGACGCCCCACTCTTTGGAATCTTC^ 

CCTCATCCTGCTGGTGATCAGGGTGGATTCTCACCTCCACACCCCCATGTACTACTTCCTCA 

CCAACCTGTCCTTCATTGACATGTGGTTCTCCACTGTC^ 

CTTGGTGTCCCCAAGCGGCAGGGCTATCTCCTTCCACAGCTGCGTGGCTCAGCTCTATTrTT 
TCCACTTCCTGGGGAGCACCGAGTGTTTCCTCTACACAGTCATGTCCTATGATCGCTACTrG 
45 GCCATCAGTTACCCGCTCAGGTACACCAGCATGATGAGTGGGAGCAGATGTGCCCTCCTGG 
CCACCAGCACnTGGCTCAGTGGCTCTCTGCACT 

TTGCCCTACTGTGGACCCAACCAGATCCAGCACTATTTGTGTGATGCACCGCCCATCCTGA 
AACTGGCCTGTGCAGACACCTCAGCCAACGAGATGGTCATCTTTGTGGACATTGGGCTAGT 

GGCCTCGGGCTGCTTTCTCCTGATAGTGCTGTCTTATGTC 
50 GGATCCACACCTCAGAGGGGAGGCACAGAGCCTTTCAGACCTGTGCCTCCCACTGCATCGT 
GGTCCTTTGCTTTTTTGTNNCCTGTGTTTTCATTTACCTGAGACCAGGCTCCAGGG 
TGGATGGAGTTGTGGCCATTTTCTACACTGTGCTGACACCCCTTCTCAACCCTGTTGTGTAC 
ACCCTGAGAAACAAGGAGGTGAAGAAAGCTGTGTTGAAACTGAGAGACAAAGTAGCACAT 

TCTCAGGGAGAATAA (SEQ ID NO: 284) 

55 
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^GGTGTTCCCAAGTGGCAGGGCTATCTCCITCCACAGCTGCATGGCTCAGCTCTATITCr 

TTCACTTCCTAGGGGGCACCGAGTGTTTCCTCTACAGGGTCATGTCCTGTGATCGCTACCT 

GGCCATCAGTTACCCGCTCAGGTACACCAGCATGATGACTGGGCGCTCGTGTACTCTTCTG 

GCCACCAGCACITGGCTCAGTGGCTCTCTGCACTCTGCTGTCCAGGCCATATTGACTTTCC 

ATTTGCCCTACTGTGGACCCAACTGGATCCAGCACTATTTGTGTGATGCACCGCCCATCCT 

GAAACTGGCCTGTGCAGACACCTCAGCCATAGAGACTGTCATTTTTGTGACTGTTGGAATA 

GTGGCCTCGGGCTGCTTTGTCCTGATAGTGCTGTCCTATGTGTCCATCGTCTGTTCCAT^CT 

GCGGATCCGCACCTCAGAGGGGAAGCACAGAGCCTTTCAGACCTGTGCCTCCCACTGTATr 

GTGGTCCTTrGCTrCTTTGGCCCTGGTCTTTrCATITACCTGAGGCCAG 

TGTGGATGGAGTTGTGGCCGTTITCTACACTGTGCTGACGCCCCTrCTCAACCCTOT 

ACACCCTGAGGAACAAGGAGGTGAAGAAAGCTCTGTTGAAGCTGAAAGACAAAGTAGCAC 
ATTCTCAGAGCAAATAG (SEQ ID NO: 276) aocac 

AOLFR150 sequences: 

MELGN\TTRVKEFIFLGLTQSQDQSLVLFLFLCLVYMTTLLG 

NLAILDICFSSTTAPK\^LDLLSKKKTISYTSCMTQIFLFHLLGGADIFSI^VMAEDCYMAISKPL 
HYVTIMSRGQCTAUSASWMGGFVHSIVQlSLLlJ > IJ > FCGPNNnLDTFYCDW 
LEFLMISNNGL\OTLWIFIXVSYT\aiMTlJlSQAGGGRRKAISTCTSPHHCGDPAFCALHLCI.r 
PALHCPPHRKGHLCHLHCHLPSAEPFDLHSEEPGNEVSHEKTEEKTRAF (SEQ ID NO: 277) 

ATGGAGTTGGGAAATGTCACCAGAGTAAAAGAATTTATATTrCTGGGACTTACTCAATCCC 

AACCTCCTCATCATGGTCACCGTGACCTGTGAGTCTCGCCTTCACACCCCCATGTACTTCCT 

GCTCCGCAATCTAGCCATCCTrGACATCTGCTTCTCCTCCACAACTGCTCCTAAAGTCTTGC 

^ACCTTCTGTCAAAGAAAAAGACCATATCCTATACAAGCTGCATGACACAGATATrTCT 

CTTCCACCTCCTTGGTGGGGCAGACATTTTTTCTCTCTCTGTGATGGCGTTTGACTGCTACA 

TGGCCATCTCCAAGCCCCTGCACTATGTGACCATCATGAGTAGAGGGCAATGCACTGCCCT 

CATCTCTGCCTCTTGGATGGGGGGCTTTGTCCACTCCATCGTGCAGATCTCCCTGTTGCTGC 

CTCTCCCTTTCTGTGGACCCAATGTTCTTGACACTTTCTACTGCGATGTCCCCCAGGTCCTC 

AAACTCACTTGCACTGACACTTTTGCTCTTGAGTTCrTGATGATTTCCAACAATG 

CACTACCCTGTGGTTTAT(^CCTGCTTGTGTCCTACACAGTCATCCTAATGACGCTGAGGT 

ctcaggcaggagggggcaggaggaaagccatctccacttgcacctccccacatcactgtg 

GTGACCCTGCATTTTGTGCCCTGCATCTATGTCTATGCCCGGCCCTTCACTGCCCTCCCCAC 

AGAAAAGGCCATCTCTGTCACCTTCACTGTCATCTCCCCTCTGCTGAACCCTTTGATCTACA 

^^^^ CCAGG ^ TG ^ GTCAGCCATGAG ^ GACTG ^ G AGAAGACTCGTGCCTT 
CTGA (SEQ ID NO: 278) 



AOLFR151 sequences: 

J^SPNHTWTEFILLGLTDDPVLEKILFGVFIAnn^ITLAGNLC]vnLLIRTNSHLQTP 
FVDICYSSmrTPNMLHNFLSEQKTISYAGCFTQCLLFlALVlTEFYILASMALDRYVAICSPLHYS 
SRMSKMCVCLVTIPYMYGFLSGFSQSLLTFHI^FCGSLEINHFYCADPPLIMLACSDTRVKKMA 
MFWAGFNI^SSLFimLSYLFIFAAIFRmSAEGRHKAFSTCASHLTIVTLFYGTLFCMYVR^ 
KSVEESKITAVFYTFLSPMLNPLIYSLRNTDVILAMQQMIRGKSFHKIAV (SEQ ID NO- 279) 



ATGTrCrCCCCAAACCACACCATAGTGACAGAATTCATTCTCTTGGGACTGACAGACGACC 

CAGTGCTAGAGAAGATCCTGTTTGGGGTATTCCTTGCGATCTACCTAATCACACTGGCAGG 

CAACCTGTGCATGATCCTGCTGATCAGGACCAATTCCCACCTGCAAACACCCATGTATTTC 

TrCCTTGGCCACCTCTCCTTTGTAGACATTTGCTATTCTTCCAATGTTACTCCAAATATGCT 

GCACAATTTCCTCTCAGAACAGAAGACCATCTCCTACGCTGGATGCTTCACACAGTGTCTT 

CTCTTCATCGCCCTGGTGATCACTGAGTTTTACATCCTTGCTTCAATGGCATTGGATCGCTA 

TGTAGCCATTTGCAGCCCrTTGCATTACAGTTCCAGGATGTCCAAGAACATCTGTGTCTGT 

CTGGTCACTATCCCTTACATGTATGGGTTTCTTAGTGGGTTCTCTCAGTCACTGCTAACCTT 

TCACTTATCCTTCTGTGGCTCCCTTGAAATCAATCATTTCTACTGCGCrGATCCTCCTCTTA 

TCATGCTGGCCTGCTCTGACACCCGTGTCAAAAAGATGGCAATGTTTGTAGTTGCAGGCTT 
TAATCTCTCAAGCTCTCTCTTCATCATT^ 

CAGGATCCGTTCTGCTGAAGGCAGGCACAAAGCCTTTTCTACGTGTGCTTCCCACCTGACA 
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LAHRPGHNVSRTFHIMFANLYIVIPPALNPMVYGVKTKQIRDKVILLFSKGTG (SEQ ID NO: 
271) 

ATGCCATCTGCCTCTGCCATGATCATTTTCAACCTGAGCAGTTACAATCCAGGACCCTTCAT 

TCTG^GTA^G^GAtGCCAGGCCTGGAGCAATTCCATGTGTGGATTGGAATT 

ATCTACATTGTAGCTGTTGTGGGAAACTGCATCCTTCrcrACC^ 

GTCTTCATGAACCCATGTTCTTCTTTCTCTCCATGCTGGCCATGACTGACCTCATC^GT^C 

ACAGCTGGTGTGCCTAAAGCACTCAGTATCTTTTGGCTAGGGGCT 

CAGGATGCCTTACACAAATGTTCITCCTTCACTATAACTTTGTCCTGGA^ 

ATGGCCATGGCATTTGATCACTATGTAGCnATCTGTTCTCCCTTC 

GACTCCCAAGACCATCATCAAGAGTGCTATGGGCATCTCCTTTCGAAGCTTCTGCATCATC 
CTGCCAGATGTATTCTTGCTGACATGCCTGCCTTTCTGCAGGACACGCATCATACCCCACA 
rATACTGTGAGCATATAGGTGTTGCCCAGCTCGCCTGTGCTGATATCTCCATCAACTTCTG 

ACGCACACATCCTCTGTGCTGTCTTrGGCCTTCCCTCCCAAGATGCCTGCCAGAAAGCCCT 
^CAOTGTGG^ 

TCCTCGCCCATCGCTTTGGACACAATGTCTCTCGCACCTTCCACATCATGTTTGCCAAT 
TACATTGTTATCCCACCTGCACTCAACCCCATGGTTTACGGAGTGAAGACCAAGCAGATCA 
GAGATAAGGTTATACTTTTGTTTTCTAAGGGTACAGGATGA (SEQ ID NO: 272) 



AOLFR148 sequences: 

MPTVNHSGTSHTWHLLGIPGLQDQHMWISIPFRSYVTALLGNSLLIFIILTKRSLHFJMYI^C 

M^A^IVLSTCTIPQALAIFWFRAGDISLDROT 

YTCILTOAL^C™ 

25 fS£™5^ 

ffiPPCIHIPLANVCILAPPMLNPIIYGIKTKQIQEQWQFLFIKQKITLV (SEQ ID NO: 273) 

ATGCCTACTGTAAACCACAGTGGCACTAGCCACACAGTCTTCCACTTGCTGGGCATCCCTG 
GCCTACAGGAC^A^CACATGTGGATTTCTATCCCATTCTTCATTTCCT 

ACCTCTTCCTCTGCATGCTGGCTGGAGCAGACATTGTCCTCTCCACGTGCAC^ 
GCCTTAGCTATCTTCTGGTTCCGTGCTGGGGACATCTCCCTGGATCGTTGCATCACT 

ctpcttcatccat 

35 ga^™Wactgtctctctgagaagttatggtacaattttccctatcat^ 
taaaaagattgactttctgccagaataatattattccacacaccttt^ 

CCTA^CCAAATATGCATGTAATGACATTCGAATAAACA'^^ 

atotcgacggtggtcttagatgt^ 

40 T<^GrATCATCATCCTCITITATGGGTCTGGCATCTTCACAATCCTTACCCAGAGGTTTGGA 
CG^CA^ATTCCACCTTGTATCCACATCCCGTTGGCT 

TTGTTTATAAAACAGAAAATAACTTTGGTTTAA (SEQ ID NO: 274) 

45 AOLFR149 sequences: 

MSNASLLTAFILMGLPHAPALDAPLFGWLVVYVLTVLGNLLILLV1RVDSHLHTTMYYF 

SFm^STVTWKIXMTLVFPSGRAISFHSCMAQLYFFHFLGGTECFL^ 

LRyTs^TGRSCTL^^ 
aVeT^TV^FVASGC™ 
50 PGSrLvVDGWAWYTVLTPLLWVVYTLRNKEVKKALLKLKDK^ (SEQ ID NO: 

275) 

ATGTCCAACGCCAGCCTACTGACAGCGTTCATCCTCATG^ 
TGGACGCCCCCCTCTTTGGAGTCTTCCTGGTGGTTTACGT^ 
55 CCTCATCCTGCTGGTGATCAGGGTGGATTCTCACCTCCACACCACCATGTACT 

CCAACCTGTCGTTCATTGACATGTGGTTCTCCACTGTCACGGTGCCCAAATTGCTGATGAC 
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AOLFR145 sequences: 

MSVQYSI^PQFMLLSNlTQFSPIF^TSFPGLEGIKHWlFIPFFFl^miVAISGNCFILIIIKTWRIH 

tpmyyllslialtolglcvstlpttmg 

^A^^YSVHTGQQVVRAGLIVII^^ 

fJ^^L^^GK^PAfflLLMAN^^ 

(SEQ ID NO: 267) 

ATGTCAGTCCAATATTCGCTCAGTCCTCAATTCATGCTGCTATCCAACATrACTCAGTITAG 

GC £^^ CTATCTCAC CAGCTTTCCTGGATTGGAAGGCATCAAACACTGGA^ 

CCTTITrCTTTATGTACATGGTTGCCATCTCAGGCAATTGTTrCArrCTGATC^ 

ACCAACCCTCGTCTGCACACACCCATGTACTATCTACTATCCTTGCTGGCCCTCACTGACCT 
GGGGCTGTGTGTGTCCACGTTGCCCACCACTATGGGGATCTrCT^^ 

ATCTACTTTGGAGCGTGTCAAATCCAGATGTTCTGCATC^Crcrim 

°I GAG 7 GCTCCTCATG ^^ 

^ G J^ A ^ ATCACTGGCCAGCAAGT GGTCAGAGCAGGCCTAATTGTCATCTTCCGGGGACC 

TGTGGCCACTATCCCTArTGTCCTCCTCCTGAAGGCTITrCCCrACTGTGGATCT^ 

TCTCCCACTCATTTTGCCTGCACCAGGAAGTGATACAGCTGGCCTGCACAGATACCACC^T 

CAATAATCTGTATGGACTGATGGTGGTAGTTTTCACTGTGATGCTGGAC^ 

GCACTGTCCTATGGACTCATCCTGCACACAGTAGCAGGCCTGGC 

GCCGTGCCTTTCAGACATGCACCGCTCATCTCTGTGCTC^^ 

ATGGGGCTGTCCCTGGTGCACCGTTTTC^^ 

GGCCMTGTCTACCTTTTTGTGCCTCCCATGCTTAACCCAATCATATACAG^ 

^^^^^ 

AOLFR146 sequences: 
I^LTDLGLTLTTLPTVMQLLWFNVF^^^ 

™ YA 4^7^ GRTG ^^ 

SWYGFALALLniVDPLUWTLILK^^ 

HPJAKHASPLVHVIMANIYLLAPPVMNPirYSVKNKQIQWGMLNF (SEQ ID No! 

ATG I£ CCAGGTGACTAAC ^^ 

GATITGAGGCCTCCCACATCTGGATCTCCATCCCCGTCTGCT 
A JS^ TACCACCATCCTCACTGTCA ™ C ^^ 

atctgtttctctccatgctggccctgacggacctgggtctcaccctcaccaccct^c^:a^:a 

GTCATCCAGCTTCTCrGGTrCAACGTTCGTAGAATCAGCTCTGAG^ 

™tottcatggattctcct™ 

GCTATGTGGCCATCTGCTGTCCCCTCCATTATGCCTCCATCCTCACCAATC 

AGAACTGGGTTAGCCATCATTTGCTGCTGTGTrCTGGCGGTrCTTCCCTCCCm 
CAAGCGACTQCCHTrCTGCCACTCCCACCTTCTCT^^ 

^°2 CCTGGTCTGTGCTGACATCAGGCTCAACA ^ 

GCTCATTATTATCGTGGATCCTCTGCTCATTGTGATCTCCTATACACTrATTCT^ 

TCTTGGGCACAGCCACCTGGGCTGAGCGACTCCGTGCCCTCAATAACTGCCTGTCCCACAT 

TCTAGCTGTCCTGGTCCTCTACATTCCCATGGTTGGTGTATCTATGACTCATCG^ 

CTTTCCCTCAAAAATATGCATTCAAGATGA (SEQ ID NO: 270) 

AOLFR147 sequences: 
^^^^ 

^M^TOLD^AGWKAI^IFWLGAREIT^ 

AIC^LRYTTTLTPKTIIKSAMGISFRSFCin^DVFLLTCLPFCRTPIL^ 

NIWGFCWIMTVISDVILUVSYAHn.CAVFGLPSQDACQKALGTCGSHVCVILMFY^ 
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AOLFRl 43 sequences: ^ 

m^wksapp^ (seq ro NO: 



263) 



^SmSmCTGOTcLcTACAACCATGITGCGTTrAATGCTTGCCTGGTCCAGAT 
rrrTTGTGGCTATTTC 

STa^gggtS^^ 



rC^TGAA^GTAGCATGTGGAGACATCCATGTTAACAACATTTATGGGCT 

taccTa^ggVatggactcaaot 

20 ATGCT^TCATCAtW 



CTC 
TTTT 



T r^rTr,rAr,TGCTGGCCTTTrATGTGCCCATAATTGCTGTCTCCATGATTCACCGCTTCTGG 

lAA^rTrCTCCACCTGTO 
G^A^A^^ 

TTCCATAAATCCCAGGCCTGA (SEQ ID NO: 264) 
AOLFRl 44 sequences: 

TVT^U^KHVPCV^^^MSNVyLFVPPVLNPLIYSAKTKEIRRAIFRMFHHIKI (SEQ ID NO: 265) 

w?StcScSg^^tgcgagtogagcccagotccatg^cc«tct^act 



SncS^CTCcXTCATGGW 



45 SJpSgctac^acatgtcctcatc^^ 

CACATCAAAATATGA (SEQ ED NO: 266) 
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CAAGGCTTATACAAGGTACTTAGACTGGGAGTGAAGGGCACCTGA (SEQ ID NO: 258) 
AOLFR140 sequences: 

MLTLNKTDLIPASFILNGWGLEDTQLWISFPFCSMYWAMVGNCGLLYLIHYEDALHKPMYY 
FLAMLSFTDLVMCSSTIPKALCIFWFHLKDIGFDECLVQ^ 

AICYPLRYSTILTNPVIAKVGTATFLRGVLLI1PFTFLTKJU.PYCRGNILPHTYCDHMSVAKLSCG 

>^VNAIYGLMVALLIGGFDILCITISYTNmJlAWSLSSADARQKAI^CTAHICAIWSYTPAF 
^^SHRFGEHIIPPSCHIWAMYLLIJPTMNPIVYGVKTKQIRDCVI^ 

ATGCTAACACTGAATAAAACAGACCTAATACCAGCTTCATTTATTCTGAATGGAGTCCCAG 

GACTGGAAGACACACAACTCTGGATTTCCTTCCCATTCTGCTCTATGTATGTTGTGGCTAT 

GGTAGGGAATTGTGGACTCCTCTACCTCATTCACTATGAGGATGCCCTGCACAAACCCATG 

TACTACTTCTTGGCCATGCTITCCTrrTACTGACCTTGTTATGTGCTCTAGTACAATCCCTAA 

AGCCCTCTGCATCTTCTGGTTTCATCTCAAGGACATTGGATTTGATGAATGCCTrGTCCAG 

ATGTTCTTCATCCACACCTTCACAGGGATGGAGTCTGGGGTGCTTATGCTTATGGCCCTGG 

ATCGCTATGTGGCCATCTGCTACCCCTTACGCTATTCAACTATCCrCACCAATCCTGTAATT 

GCAAAGGTTGGGACTGCCACCTTCCTGAGAGGGGTATTACTCATTATTCCCTTTACTFTCCT 

CACCAAGCGCCTGCCCTACTGCAGAGGCAATATACTTCCCCATACCTACTGTGACCACATG 

TCTGTAGCCAAATTGTCCTGTGGTAATGTCAAGGTCAATGCCATCTATGGTCTGATGGTrG 

CCCTCCTGATTGGGGGCTTTGACATACTGTGTATCACCATCTCCTATACCATGATTCTCCGG 

GCAGTGGTCAGCCTCTCCTCAGCAGATGCTCGGCAGAAGGCCTTTAATACCTGCACTGCCC 

ACATTTGTGCCATTGTTTTCTGCTATACTCCAGCTTT 

GGGAACACATAATCCCCCCTTCTTGCCACATCATTGTAGCCAATATTTATCTGCTCCTACCA 
CCCACTATGAACCCTATTGTCTATGGGGTGAAAACCAAACAGATACGAGACTGTGTCATAA 
GGATCCTTTCAGGTrCTAAGGATACCAAATCCTACAGCATGTGA (SEQ ID NO- 260) 




A^CCAGCACTCTTGGCCACAACATGGAATCTCCTAATCACACTGATGTTGACCCTrCTG 

TCITCTTCCTCCTGGGCATCCCAGGTCTGGAACAATTTCATTTGTGGCTCTCACTCCCTGTG 

TGTGGCTTAGGCACAGCCACAATTGTGGGCAATATAACTATTCTGGTTGTTGTTGCCACTG 

AACCAGTCITGCACAAGCCTGTGTACCTTTTTCTGTGCATGCTCTCAACCATCGACr^ 

GCCTCTGTCTCCACAGTTCCCAAGCTACTGGCTATCTTCTGGTGTGGAGCCGGACATATAT 

CTGCCTCTGCCTGCCTGGCACAGATGTTCTTCATTCATGCCTTCTGCATGATGGAGTCCACT 

GTGCTACTGGCCATGGCCTTTGATCGCTACGTGGCCATCTGCCACCCACTCCGCTATGCCA 

CAATCCTCACTGACACCATCATTGCCCACATAGGGGTGGCAGCTGTAGTGCGAGGCTCCCT 

GCTCATGCTCCCATGTCCCITCCTrATTGGGCGTrTGAACrTCTGCCAAAGCCATGTGATCC 

TACACACGTACTGTGAGCACATGGCTGTGGTGAAGCTGGCCTGTGGAGACACCAGGCCTA 

ACCGTGTGTATGGGCTGACAGCT(K:ACTGTTGGTCATTGGGGTTGACITGTrrTGCATTGG 

TCTCTCCrATGCCCTAAGTGCACAAGCTGTCCTTCGCCTCTCATCCCATGAAGCTCGGTCCA 
^^CTA^ACCTGTGGTTCCCATGTCT^ 

TTCTCCi 1 1 1 1 lACACACCGCITTGGCCATCACGTTCCAGTCCATATTCACATTCTTTTGGC 

AGATCCGTAAAAGAGTTGTCAGGGTGTTTCAAAGTGGGCAGGGAATGGGCATCAAGGCAT 
CTGAGTGA (SEQ ID NO: 262) 
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GTGGCCATCTGTCATCCTCTACATTATGCCACCATCATGACTCAGAGCCAGTGTGTCATGC 
TGGTGGCTGGGTCCTGGGTCATCGCTTGTGCGTGTGCTCTTTTGCATACCCTCCTCCTGGCC 
CAGCTTTCCTTCTGTGCTGACCACATCATCCCTCACTACTTCTGTGACCTTGGTGCCCTGCT 
CAAGTrGTCCTGCrCAGACACCTCCCTCAATCAGTTAGCAATCTTTACAGCAGCATTGACA 
5 GCCATTATGCTTCCATTCCTGTGCATCCTGGTTTCTTATGGTCACATTGGGGTCACCATCCT 
CCAGATTCCCTCTACCAAGGGCATATGCAAAGCCTTGTCCACTTGTGGATCCCACCTCTCA 
GTGGTGACTATCTATTATCGGACAATTATTGGTCTCTATTTTCTTCCCCCATCCAGCAACAC 
CAATGACAAGAACATAATTGCTTCAGTGATATACACAGCAGTCACTCCCATGTTGAACCCA 
TTCATTTACAGTCTGAGAAATAAAGACATTAAGGGAGCCCTAAGAAAACTCTTGAGTAGG 
10 TCAGGCGCAGTGGCTCATGCCTGTAATCTCAGCACTTTGGGAGGCTGA (SEQ ID NO: 254) 
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AOLFR138 sequences: 

MLNFTDVTEF1LLGLTSRREWQVLFFIWLV\^TVVGNIGMMLLIKVSPQLNSPMYFFLSHLS 
FWWFSSNVTPIOVILENLFSDKKTISYADCLAQCFFFlAL\^VEIFIIAAIAFDRYTVlGNPLLY 
1 5 GSKMSRGVCIPJuITFPYIYGFLTSLTATLWTYGLYFCGKIEINHFYCADPPLIKMACAGTFVKEY 
" TMLILAGINFTYSLTVniSYLFILIAILRMRSAEGRQKAFSTCGSHPTAVnFYGTLIFMYLRRPTE 
ESVEQGKMVAWYTTVIPMLNPM1YSLRNKX)VKKAMMKVISRSC (SEQ ID NO: 255) 

ATGCTCAATTTCACCGATGTGACAGAGTTCATTCTTTTGGGGCTAACGAGCCGTCGGG^T 
20 GGCAAGTTCTCTTCTTCATCGTTTTTCTTGTGGTCTACATTATCACCGTGGTGGGCAATATC 

Sgcatgatg^gttaatcaaggtcagtcctcagctta^ 

GTCACTTGTCATTTGTTGATGTGTGGTTTTCrTCCAATGTCACCCCTAAAATGTTGGaAAAT 
CTGTTATCAGATAAAAAAACAATTTCTTATGCTC 

TGCTCTTGTCCATGTGGAAATTTTTATTCTTGCTGCGATTGCCTTTGATAGATACACAGTGA 
25 TTGGAAATCCTTTGCrTTATGGCAGCAAAATGTCAAGGGATGTCTGTATTCGACTGATTAC 
TTTCCCTTACATTTATGGTTTTCTGACGAGTCTGACAGCAACATTATGGACTTATGGCnTGT 
ACTTCTGTGGAAAAATTGAGATCAACCATTTCTACTGTGCAGATCCACCTCTCATCAA^T 

GGCCTGTGCCGGGACCTTTGTAAAAGAATATACAATGCT^ 

ACATATTCCCTGACTGTAAITATCATCTCTTACTTATTCATCCTCATTGCCATTCTGCGAAT 
30 GCGCTCAGCAGAAGGAAGGCAGAAGGCCTTTTCCACATGTGGGTCCCATCTGACAGCTGT 
CATCATATTCTATGGTACTCTGATCTTCATGTATCTCAGACGTCCCACAGAGGAGTCTGTG 
GAGCAGGGGAAGATGGTGGCTGTGTTCTATACCACAGTGATCCCCATGTTGAATCCCATGA 
TCTACAGTCTGAGGAACAAGGATGTGAAAAAGGCCATGATGAAAGTGATCAGCAGATCAT 

GTTAA (SEQ ID NO: 256) 



AOLFR139 sequences: 

MGFPGIHSWQHWLSLPLALLYLLALSANILILIIINKEAALHQPMYYFLGILAMADIGIATTIMP 
KILAILWFNAKTISLLECFAQMYAIHCFVAMESSTFVCMAIDRYVAICRPLRYPSIITESFVFKAN 

GFMALRNSLCLISWLLAAQRHYCSQNQIEHCLCSNLGVTSL^ 

DLGLIII^YALILYSVLKLNSPEAASKALSTCTSHLILILFFYTVimSITRSTGMRWLIPVLIjm. 
HNVBPP ALNPMVYALKNKELRQGLYKVLRLGVKGT (SEQ ID NO: 257) 



ATGGGATTCCCTGGCATTCACAGTTGGCAGCACTGGCTCTCCCTGCCCCTGGCTCTGCTCT 
ACCTCTTAGCTCTCAGTGCCAACATCCTTATCCTGATCATCATCAACAAAGAGGCAGCACT 

45 GCACCAGCCTATGTACTATrTCCTGGGCATCTTGGCTATGGCAGACATAGGCCT 

ACCATCATGCCTAAGATTTTGGCCATCTTATGGTTCAATGCTAAGACCATCAGTCTCCTGG 
AGTGCTTTGCTCAGATGTATGCCATACATTGCTTTGTGGCCATGGAATCAAGTACCTTTGT 
CTGCATGGCTATTGATAGATATGTAGCCATTTGTCGACCGCTACGATATCCATCAATCATC 
ACTGAATCrTTTGTTTTCAAAGCAAATGGGTTCATGGCACTGAGAAACAGCCTGTGTCTCA 

50 TCTCAGTGCCrCTGTTGGCTGCCCAGAGGCATTACTGCTCCCAGAATCAAATTGAGCACTG 
TCTTrGTTCTAACCrTGGAGTCACTAGCCTATCTTGTGATGATCGAAGAATCAATAGCATT 
AACCAGGTCCTITTGGCTTGGACACTCATGGGAAGTGACCrrGGGTTTGATTATTTTATC^ 
ATGCTCTAATACTTTACTCTGTCCTGAAGCTGAACTCTCCAGAAGCTGCATCCAAGGCCTT 

AAGTACCrGCACCrCCCACCTCATCTTAATCCTTTTCTTCT^^ 
55 CCATTACTCGTAGTACAGGAATGAGAGTTCCCCTTATTCCAGTTCTACTTAATGTGCTACA 
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ATGATTTTCCCTTCTCATGATAGTCAGGCTTTCACCTCCGTGGACATGGAAGTGGGAAATT 

GCACCATCCTGACTGAATTCATC1TGTTGGGTTTCTCAGCAGATTCCCAGTGGCAGCCGAT 

TCTATTTGGAGTGTTTCTGATGCTCTATTTGATAACCITGTCAGGAAACATGACCrTGGTTA 

TCTTAATCCGAACTGATTCCCACTTGCATACACCTATGTACITTTTCATTGGCAATCT 

TTTTTGGATTTCTGGTATACCTCrGTGTATACCCCCAAAATCCTGGCCAGTTGTGTCrCAGA 

AGATAAGCGCATTTCCTTGGCTGGATGTGGGGCrCAGCTGTTTTTTTCCTGTGTrGTAGCCT 

ACACTGAATGCTATCTCCTGGCAGCCATGGCATATGACCGCCATGCAGCAATTTGTAACCC 

ATTGCTTTATTCAGGTACCATGTCCACCGCCCrCTGTACTGGGCTTGT^ 

TAGGAGGAi i 1 1 i GAATG CCATAG CCCATACTGCCAATACATTCCGCCTGCATTTTTGTGG 

TAAAAATATCATTGACCACTTTTTCTGTGATGCACCACCATTGGTAAAAATGTCCTGTACA 

AACACCAGGGTCTACGAAAAAGTCCTGCTTGGTGTGGTGGGCTTCACAGTACTCTCCAGCA 

TTCTTGCTATCCTGATTTCCTATGTCAACATCCTCCTGGCTATCCTGAGAATCCACTCAGCT 

TCAGGAAGACACAAGGCATTCTCCACCTGTGCTTCCCACCTCATCTCAGTCATGCTCTTCTA 

TGGATCATTGTrGTTTATGTATTCAAGGCCTAG.TTCCACCTACTCCCTAGAGAGGGACAAA 

GTAGCTGCTCTGTTCTACACCGTGATCAACCCACTGCTCAACCCTCTCATCTATAGCCTGAG 

AAACAAAGATATCAAAGAGGCCTTCAGGAAAGCAACACAGACTATACAACCACAAACATG 
A (SEQ ID NO: 250) 

AOLFR136 sequences: 

MTMENYSMAAQFVLDGLTQQAELQLPLFLIJLGI 

SSI^FVDFCYSSVITPKMLVNFLGKJWTILYSECMVQLFFFVWVVAEGYLLTAMAYDRYVAIC 
SPLLYNAIMSSWVCSLLVLAAFFLGFI^ALTHTSAMMKIJSFCKSHIINHYFCDVLPLLNI^CSNT 
HLNELLLFIIAGFNTLVPTLAVAVSYAFILYSILHIRSSEGRSKAFGTCSSHLMAVVIFFGSITFMY 
FKJ>PSSNSLDQEKVSSVFnTTVIPMLOTLIYSLR>JKDVKXALRKVLVGK (SEQ ID NO: 251) 

ATGACCATGGAAAATTATTCTATGGCAGCTCAGTTTGTCTTAGATGGTTTAACACAGCAAG 

CAGAGCTCCAGCTGCCCCTCTTCCTCCTGTTCCTGGGAATCTATGTGGTCACAGTAGTGGG 

CAACCTGGGCATGATTCTCCTGATTGCAGTCAGCCCTCTACTTCACACCCCCATGTACTATT 
TCCTCAGCAGCTTGTCCTTCGTCGATTTCTGCTATC 

GTGAACTTCCTAGGAAAGAAGAATACAATCCTTTACTCTGAGTGCATGGTCCAGCTCTTTT 

TCTTTGTGGTCTTTGTGGTGGCTGAGGGTTACCTCCTGACTGCCATGGCATATGATCGCTA 

TGTTGCCATCTGTAGCCCACTGCTTTATAATGCGATCATGTCCTCATGGGTCTGCTCACTGC 

TAGTGCTGGCTGCCTTCTTCTTGGGCTTTCTCTCTGCCTTGACTCATACAAGTGCCATGATG 

AAACTGTCCTTTTGCAAATCCCACATTATCAACCATTACTTCrGTGATGTTCTTCCCCT 

CAATCTCTCCTGCTCCAACACACACCTCAATGAGCTTCT 

ACACCrTGGTGCCCACCCTAGCTGTTGCTGTCTCCTATGCCTTCATCCTCTACAGCATCOT 

CACATCCGCTCCTCAGAGGGCCGGTCCAAAGCTTTTGGAACATGCAGCTCTCATCTCATGG 

CTGTGGTGATCrrTCrTTGGGTCCATTACCTrCATGTATTTCAAGCCCCCrTCAAGTAACTCC 

CTGGACCAGGAGAAGGTGTCCTCrGTGTTCTACACCACGGTGATCCCCATGCTGAACCCTT 

TAATATACAGTCTGAGGAATAAGGATGTGAAGAAAGCATTAAGGAAGGTCTTAGTAGGAA 
AATGA (SEQ ID NO: 252) 

AOLFR137 sequences: 

MSPENQSSVSEFLLLGLPIRPEQQAVFFAJLFLGMYLTTV^ 

IALTDISFSSVTVPKMLMNMQTQHIAVFYKGCISQTYFFIFFADLDSFLITSMAYDRYVAICHPL 

HYATIMTQSQCVMLVAGSWVIACACALLHTLLLAQLSFCADHIIPHYFCDLGALLKLSCSDTSL 

NQLAIFTAALTAIMLPFLCmVSYGmGVmQIPSTKGICKALSTCGSHLSVVTrr^TnGLYFlJ' 

PSSNTNDKNHASVIYTAVTPMU^FIYSLRNKDIKGALRKLLSRSGAVAIiACNLSTLGG (SEO 
ID NO: 253) v x 

ATGAGCCCTGAGAACCAGAGCAGCGTGTCCGAGTTCCTCCTCCTGGGCCTCCCCATCCGGC 
CAGAGCAGCAGGCCGTGTTCTTCGCCCTGTTCCTGGGCATGTACCTGACCACGGTGCTGGG 
GAACCTGCrCATCATGCTGCTCATCCAGCTAGACTCTCACCTTCACACCCCCATGTACTTCT 
TCCTTAGCCACTTGGCCCTCACTGACATCTCCnTTTCATCTGTCACTGTCCCTAAGATGCTG 
ATGAACATGCAGACTCAGCACCTAGCCGTCTTTTACAAGGGATGCATTTCACAGACATATT 
TTTTCATATTTTTTGCTGACTTAGACAGTTTCCTrATCACTTCAATGGCATATGACAGGTAT 
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WOVCTVLVGIAWVGGFMHSFAQILLIFHLLFCGPKVINHYFCDLWLIJO^ACSDTFLIG 
G^TLSvisFG^LAS 

DKMVAWYTVITAILNPVIYSLRNAEMRKAMKRLWIRTLPJLNEK (SEQ ID NO: 245) 
5 ATGACTGAATTCATTTTTCTGGTACTTTCTCCCAACCAGGAGGTGCAGAGGGTTTGCTTTG 

^gata^tgtcWaca 

ATCACCAGCA^AAGCCTTG^^ 

GATCTGCTACrCCTCCGCrACAGCCCCCAAACTCATCTCAGATCrGCTGGCTGAAAGGAAA 

GTCATATCTTGGTGGGGCTGCATGGCACAGCTTTTCTTCTTGCAOT 
10 GATTTTCCTGCTCACTGTGATGGCCTATGACCACTATGTGGCCATCTGCAAGCCCCTCAGC 
TA^CACCACCATCATGAACTGGCAGGTGTGTACTGTCCTTGTAGGAATAGCATGGGTCGGA 

GGCTTCATGCATTCCTTTGGACAA 
TGTX3ATCAATCACTATTTCT 

TCCTCATrGGTCTGCTGA^GTTGCCAATGGAGGCACCCTGTCT 
1 5 CCTCTTAGCATCCTATATGGTCATCTTGCTCCATCTGAGAACCTGGAGCTCTGAAGGGTGG 

?Saa1gccct^ 

cgtcftcaactctctgaggccttctaccactctgcccatagacaagAtggtgg 
acISg^^c^gcg^ 

GAAGGCCATGAAGAGGCTGTGGATTAGGACATTGAGACTAAATGAGAAATAG (SEQ ID 
20 NO: 246) 

AOLFR134 sequences: 

MTTHLEVDNHTVTTRFILLGFPTRPAFQLLFFSIFL^ 

• sISiSmwyvwispkmlvdfi^ 

25 N^mYPVIMTOQLCGTLAGGC^ 
DAWAEM^EFLALMVIAIPLCV^ 
^A^LMYAYNSNKWSVLYTVIVPLLNPir^ 

S (SEQ ID NO: 247) 

30 ATGACCACCATAATTCTGGAAGTAGATAATCATACAGTGACAACACGTTTCATTCTTCTGG 
GGmCCAACACGACCAGCCTrCCAGCTTCTCTTTTTC^ 
CTGlcACTCCTGGAGAATCnTCTTATCATCTTAGCT^^ 

agcccatgtacScttcttgagccacctctccttcct^ 

iGCCCCAAGATGOTGTTGACrrCCTCAG 
35 TGACTCAACTTTCACITTTTTGTC 

gccSgIc^I^^ 

AGCTCTGTGGCACACTGGCTGGAGGATGCTGGTTCTGTGGACTCATGACT 
GATGGTTTTTATAGCACAACnrTCACTACTGTGGCATGCCTCAGATCAATCACTACTTTT^^ 

• ATATCTCTCCACTCCTrTAACGTCTCCTGTGAGGATGCCTCACAGGCTGAGATGGTGGA 

40 OTOTGGCCCTCATGGTCArrGCTATTCCTCITrGTGTT 

Tnr.rTCCCACCTGACCGTCGTAATTCTCTTCTATTCCATGACACrTTTCACCTATGCCCGTC 
CCAA^CTCA^GTATGCCT 

ccaCtcctc^\accccaVcatttactgtctgaggaaccatgaagtaaaggcag 

45 5£JS^ 
ID NO: 248) 



50 



55 



AOLFR135 sequences: 

mif'pshdsqaftsvdmevgnctiltefillgfsadsqwqpilfgwlmlylitlsgnm^™rt 

SlHTpS^ 

A AMA^^RHA^CNP LLYSGTMST ALCTGL V AG S Y IG GFLN AIAHTANTFRLHFCGKNIIDHFFC 
T (SEQ ID NO: 249) 
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CCGTTCCACnTTGACTGCTCACATCACAGTGGTGATTCT^ 

TCTACATTTGGCCCrTCGGCAACCACTCTGTAGATAAGTTCCTTGCTGTGTTTTATACCATr 

ATCACTCCTATCTTGAATCCAATTATCTATACTCTGAGAAACAAAGAAATGAAGATATCCA 
TGAAAAAACTCTGGAGAGCTTTTGTGAATTCTAGAGAAGATACTTAG (SEQ ID NO: 240) 

AOLFW31 sequences: 

MA STShA/TELIFTGLFQDP A VQS VCFVWLP VYIATWGNGLIVLTVSISKSLDSPM YFFLSCL<? 
LVEISYSSTlAPKTimLLAKIKTlSLEGCLTQIFFPHFPGVAEILLIVVMAYDCWAICKPLHYMNT 
ISRQLCHLLVAGSWLGGFCHSIIQILVnQLPFCGPNVIDHYFCDLQPLFKlJVCTDTFMEGVlVLA 
NSGLFSWSFLILVSSYIVIL\^RNHSAEGRHKALSTCASHITVVILFTGPAIFLYMPJ>SSTFTED 
KLVAWYTV1TPMLNPUYTLRNAEVK1AIPJILWSKKENPGRE (SEQ ID NO: 241) 

ATGGCCAGTACAAGTAATGTGACTGAGTTGATTTTCACTGGCCTTTTCCAGGATCCAGCTG 

TGCAGAGTGTATGCTTTGTGGTGTTTCTCCCCGTGTACCTTGCCACGGTGGTGGGCAATGG 

CCrCATCGTTCTGACGGTCAGTATCAGCAAGAGTCrGGATTCTCCCATGTACTTCTTCCTTA 

GCTGCCTGTCCTTGGTGGAGATCAGTTATTCCTCCACTATCGCCCCTAAATTCATCATAGAC 

™ C IE GCCAAGA ™ AAACCATeTCT ^ 

CITCrTTGGGGTTGCTGAGATCCTTTTGATTGTGGTGATGGCCTATGATTGCTACGTGGCC 

ATTTGCAAGCCTCTTCATTATATGAACATTATCAGTCGTCAACTGTGTCACCTrCTGGTGGC 

TGCTrCCTGGCTGGGGGGCITTrGTCACTCCATAATTCAGATTCTCGTrATCATCC 

CCTTCTGTGGTCCCAATGTGATTGACCACTATTTCTGTGACCTCCAGCCTTTATTCAAGCTT 

GCCTGCACTGACACCTrCATGGAGGGGGTTATTGTGTTGGCCAACAGTGGATTATTCTCTG 

TC^CrCCTTCCrCATCTTGGTGTCCTCrrATATTGTCATTCTGGTCAACrTGAGGAAC^T 

TCTGCAGAGGGGAGGCACAAAGCCCTCTCCACCTGTGCTTCTCACATCACAGTGGTCATCT 

TGTTTTTTGGACCTGCTATCTTCCTCTACATGCGACCTTCTTCCACTTTCACTGAAGATAAA 

CTTGTGGCTGTATTCTACACGGTCATCACCCCCATGCTGAACCCCATCATTTACACACTCAG 

GAATGCAGAGGTGAAAATCGCCATAAGAAGATTGTGGAGCAAAAAGGAGAATCCAGGGA 
GGGAGTGA (SEQ ID NO: 242) 

AOLFR132 sequences: 

MVATNNVTEIIFA'GFSQNWSEQRVISVMFLLMYTAVVLGNGLrVVTIL^ 

SFVEICY CS^^yL^PKLIFDSFIKRKVISLKGCLTQMFSLHFFGGTEAFLLMVMAYDRYV AJCKPL 
HYMAIMNQPJVlCGLLVRIAWGGGLmSVGQTFLIFQLPFCGPNIMDHYFCD\aiPVLELACADT 
FFISLLIITNGGSISWSFF\^MASYLin.HFLRSHNLEGQHKAI^CASHVTVVDLITIPCSLVYIR 
PCVTLPADKJVAWYTWTPLLOTVIYSFRNAEVKNAMRRFIGGKVI (SEQ ID NO: 243) 

ATGGTTGCTACAAACAATGTGACTGAAATAATTTTCGTGGGATTTTCCCAGAATTGGAGTG 
AGCAGAGGGTCATTTCTGTGATGTrTCTCCTCATGTACACAGCTG'rTGTGCTGGGCAATGG 
CCrCATTGTGGTGACCATCCrGGCCAGCAAAGTGCrCACCrCCCCCATGTATTTCTTTCTCA 
GCTACTTATCCTTTGTGGAGATCTGCTACTGTTCTGTCATGGCCCCCAAGCTTATCTTTGAC 
TCCl l l ATCAAGAGGAAAGTCATTTCTCTCAAGGGCTGCCrCACACAGATGTTTTCCCTCC 

A 7I!!^ GGTGGCACTGAGGC 

CATCTGCAAGCCCTTGCACTACATGGCCATCATGAACCAGCGAATGTGTGGTCTCCTCGTG 

AGGATAGCATGGGGCGGGGGCCrGCTGCATTCTGTTGGGCAAACCTTCCTGATTTTCCAGC 

TCCCGTTCTGTGGCCCCAACATCATGGACCACTACTrCTGTGATGTCCACCCAGTGCTGGA 
G °I GGCCTGCGCAGACACC ^ OTCA ^ AGCCTGCT GAT^^ 

TCCGTAGTCAGTTTCTTCGTGCTGATGGCTTCCTACCTGATCATCCTGCACTTCCTGAGAAG 

CCACAACTTGGAGGGGCAGCACAAGGCCCTCTCCACCTGTGCCTCTCATGTCACAGTTGTC 

GACCTGTTCrTCATACClTGCTCCTTGGTCTATATTAGGCCCTGTGTCACCCTCCCTGCAGA 

CAAGATAGTTGCTGTATTTTATACAGTGGTCACACCTCrCTTAAACCCTGTGATTTACTCCT 

rcAGGAATGCTGAAGTGAAAAATGCCATGAGGAGATTTATTGGGGGAAAAGTAATTTGA 
(SEQ ID NO: 244) 

AOLFR133 sequences: 

MTEFIFLVI^PNQEVQRVCFW^ 
SATAPKLISDLLAERKVTSWWGCMAQ 
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GGCATATGATCGCTATGTAGCCATATGCAAACCTTTACTATATCCAGTGATTATGAACAAT 
TGAAGTCCTTATATTCAGATTAACCTTCTGCAATTCTAACATAATACATCATTTTTACTGTG 

atatta?a^cactgtttatgatttcctgtactgaccct^ 

5 TTTTGTCTGGCTCAATTCAGGTATTCACCATTGTGACAGTTCTTAATTCTTACACA^G^CT 
CmCACAATCCTAAAAAAGAAGTCTGTTAGAGGCGTAAGGAAAGCCm 
GAGCCC^CTCTTATCTGTCTCTTrATATTATG 

gca?ct^cSaagcagatgac^ 

1 0 AATGGTAAAAAGAAATGTTTAG (SEQ ID NO: 236) 
AOLFR128 sequences: 

METONLTVVTEFILLGLTQSQDAQLLWVLVLIFYLIILPGNFLIIFTIKSDPGLTAPLYFFLGl^A 
LLDaS 

ADKWSLFHTVIFPLMNPVIYTLRNQEVKASMRKLLSQHMFC (SEQ ID NO: 237) 

ATGGAAACACAGAACCTCACAGTGGTGACAGAATTCATTCTTCTTGGTCTGACCCAGTCTC 
20 AAGATGCTCAAC^TCTGGTCTTTGTGCTAGTCTrAATTT^ 

^^GCAACTTGGCCTTACTGGATGCATCCTACTCCTTCATTGTC^ 

GTCGA^CTCTCTGAGAAGAAGGTAATCTCCTATAGAAGCTGCATCACTCAGCTCm 

25 Itoccatctgccggcctitacactattcaaccatcatgaaco 

^g^ggttctgtggcttgggggctttatccattccattgtacaagt^^ 
cacSgcc^ctgtggcccaaaccagctcgataacttcitctgtgatgttcca 

ca^ggc^ 

ctcagcctcctctgcttcctgggccttctggcctcctatgcagtcatcctct 
30 Sagcactcctctgaaggaaagagcaaggctatt^ 
a?at^ctca7g^ggacctgcta 
tgacaagctagtttctcttttccatact 

cgcttoocaaccagga 
ctga(seqidno:238) 

35 

AOLFR129 sequences: 

MALYFSLILHGMSDLFTLSTGHPRASCRMEAMKLLNQSQVSEFILLGLTSSQDVEFLLFALFS^I 

y^™nll^vfntpnlntpmyfllgnlsfvdmtlasfatpkv^ 

GQSKAP^TLTAHITVVILFFGPCIFrfrWPFGNHSVDKFLAVFYTn 
KLWRAFVNSREDT (SEQ ID NO: 239) 

ATGGCTCTTTATTTTTCACTCATACTCCAT 
ac TrATCCAAGAGCGAGCTGTAGGATGGAGGCCATGAAACTATTAAATCAATCTCAAGTGTC 

IgIa^^gctggg^ctgaccagct^ 

TCTCGGTOTCTATGTGGTCACAGTTTTGGGTAACCTTCTTATTATAGTCACAG 
A^CCCTA^CCTGAATACTCCCATGTATTTTCTCCnTGGTAATCTCT 

COTGCTTCrTTTGCCACCCCTAAGGTGATTCTGAACTTGTTAAAAAAGCAGAAGGTAA^ 

50 TCTTTTrGCTGGGTGCTTCACTCAGATATTTCTCCT^ 
ACTGTTGGTCTCCATGGCITITGACAGATATGTGGC 
ACCATOATGAACAAGAAGGTATGTGTTTTGCTTGTAGTGACCT 
^CACrCAGGGTnnrcAGATACCATTTGCTGTGA^ 

gIcagcatt^ 

55 acaggtagtcattgttgccaacagtggcataatctccctgagctgttt 
tctcctacagtctgatcctcataaccattaagaaccact^ 
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ATGACCAATCAGACACAGATGATGGAATTCTTGCITGTGAGATTTACTGAGAATTGGGTGC 

TCCTGAGGCTGCATGCTTTGCrCTTCTCACTGATCTACCTCACGGCTGTGCTGATGAATTTA 

GTCATCATTCTCCTCATGATTCTGGACCATCGTCTCCACATGGCAATGTACTTTTTCCTCCG 

ACATTTGTCCTTCTTAGACCTGTGTCTCATTTCTGCCACAGTCCCCAAATCCATCCTCAACT 

CTGTCGCCTCCACTGACTCCATCTCCTTCCTGGGGTGTGTGTTGCAGCTCTTCTTGGTGGTA 

CTGCTGGCTGGATCAGAGATTGGCATCCTTACTGCCATGTCCTATGACCGCTATGCTGCCA 

TCTGCTGCCCCCTACACTGTGAGGCTGTCATGAGCAGAGGGCTCTGTGTCCAGTTGATGGC 

TCTGTCCTGGCTCAACAGAGGGGCCrTGGGACrCTTGTACACAGCTGGAACATTCTCTCTG 

AATTTTTATGGCTCTGATGAGCTACATCAGTTCTTCTGCGATGTCCCTGCCCTACTAAAGCT 

CACTTGTTCTAAAGAACATGCCATCATTAGTGTCAGTGTGGCCATTGGGGTCTGTTATGCA 

TTTTCATGTTTAGTTTGCATTGTAGTTTCCTATGTGTACATTTTCTCTGCTGTGTTAAGGAT 

ATCACAGAGACAGAGACAATCCAAAGCCTTTTCCAACTGTGTGCCTCACCTCATTGTTGTC 

ACTGTGTTTCTTGTAACAGGTGCTGTTGCTTATTTAAAGCCAGGGTCTGATGCACCTTCTAT 

TCTAGACTTGCTGGTGTCTGTGTTCTATTCTGTCGCACCTCCAACCTTGAACCCTGTTATCT 

ACTGTCTGAAGAACAAGGACATTAAATCCGCTCTGAGTAAAGTCCTGTGGAATGTTAGAA 

GCAGTGGGGTAATGAAAGATGACTAA (SEQ ID NO: 232) 

AOLFR1 26 sequences: 

MFLYLCFIFQRTCSEEMEEENATLLTEFVLTGFLHQPDaaPLFIAFL\aYLITlMGNLGLIVLm 
KDPHLHffMYLFLGSLAFVDASLSSTVTPKMLINFL^ 

ATMAYDRYVAICKALLYPVIMTNELCIQLLVLSFIGGLLHALmEAFSFPXTFCNSNIIOHFYCDn 
PLLKISCTDSSIOTLA-fVTIFAGSVQVFTIGTILISYTn^ 

YGPLTCKYLGSASPQADDQDMMESLFTT\OWLL^^ (SEQ m 

ATGTTCCTTTACCTTTGCTTCATTTTTCAGAGGACATGCAGTGAGGAGATGGAAGAGGAAA 

ATGCAACATTGCTGACAGAGTTTGTTCTCACAGGATTTTTACATCAACCTGACTGTAAAAT 

ACCGCTCTTCCTGGCATTCTTGGTAATATATCTCATCACCATCATGGGGAATCTTGGTCTAA 

TTGTTCTCATCTGGAAAGACCCTCACCTTCATATCCCAATGTACITATTCCrTGGGAGTTTA 

GCCTTTGTGGATGCrTCGTTATCATCCACAGTGACTCCGAAGATGCTGATCAACTTCTTAG 

CTAAGAGTAAGATGATATCTCTCTCTGAATGCATGGTACAATTTTTTTCCCTTGTA^ 

GTAACCACAGAATGTTTTCTCTTGGCAACAATGGCATATGATCGCTATGTAGCCATTTGCA 

AAGCITTACrrTTATCCAGTCATTATGACCAATGAACTATGCATTCAGCTATTAGTCTTGTCA 

TTTATAGGTGGCCTTCTTCATGCTTTAATCCATGAAGCTTTTTCATTCAGAITAAC 

TAATTCCAACATAATACAACACrTTTACTGTGACATTATCCCATTGTTAAAGATTTCCTGTA 

CTGATTCCTCTATTAACTTTCTAATGGTTTTTATTTTCGCAGGTTCT 

rrGGAACTATTCTrATATCTTATACAATTATCCTCTTTACAATCrTAGAAAAGAAGTCTATC 

AAAGGGATACGAAAAGCTGTCTCCACCTGTGGGGCTCATCTCTTATCTGTATCTTTATACT 

ATGGCCCCCTCACCTTCAAATATCTGGGCTCTGCATCTCCGCAAGCAGATGACCAAGATAT 

GATGGAGTCTCTATTTTACACTGTCATAGTTCCITrATTAAATCCCATGATCTACAGCCTGA 

GAAACAAGCAAGTAATAGCTTCATTCACAAAAATGTTCAAAAGCAATGTTTAG (SEO ID 
NO: 234) v v 



AOLFR127 sequences: 

MSNEDMEQDNTTLLTEFVLTGLTYQPEWKMPLFL 

YFFLGSIAFVDAWISSTVTPKMLVNFIAXNRMISLSECMIQFFSFAFGGTTECF^ 
VAICKPLLYPVIMNNSLCIRLI^SFLGGFIJiALmEVLIFRLTFC^SMIHHFYCDriPLFNnSCTO 
PSINFLMVFILSGSIQVFTTVTVLNSYTFALmLKIOCSVRGVRKAFSTCGAmLSVSLYYGPLIF 
MYlJtPASPQADDQDMIDSVFYTlIIPLLNPIIYSLRNKQVIDSFTKMVKP^^ (SEQ ID NO: 235) 

ATGTCGAATGAGGACATGGAACAGGATAATACAACATTGCTGACAGAGTTTGTTCTCACA 

GGACTTACATATCAGCCAGAGTGGAAAATGCCCCTGTTCTTGGTGTTCTTGGTGATCTATC 

TCATCACTATT GTGTGGA ACCTTGGTCTGATTGCTCTTATCTGGAATGACCCACAACTTCAC 

ATCCCCATGTACTrTTTTCTTGGGAGTTTAGCCTTTGTTGATGCTTGGATATCTTCCACAGT 

AACTCCCA AAATGTTG GTTAATTTCTTGGCCAAAAACAGGATGATATCTCTGTCTGAATGC 

ATGATTCAAi iini 1 CCTTTGCATTTGGTGGAACTACAGAATGTTTTCTCTTGGCAACAAT 
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MMLPFCGNNVIDHITCEILALLKLVCSDITINVLIM 

GRKK^ST^AHSIWILFYGSALFMYMKPKSKNTNTSDEIIGLSYGVVSP 
KEAVKKVLSRHLHLLKM (SEQ ID NO: 227) 

5 ATGTACAGATTTACAGATTTTGATGTATCAAACAT 

CTATACTACCCAGCAGGCAGGTGACCTAGAACACATGGAGACAAGAAATTACTCTGCCAT 
GACTGAATTCTTTCTGGTGGGGCTTTCCCAATATCCAGAGCTCCAGCTTTTTCTGTTCCTGC 
TCTGCCTCATCATGTACATGATAATCCTCCTGGGAAATAGCCTCCTCATTATCATCACCATC 
TTGGATTCTCGCCTCCATACTCCCATGTATTTCTTTCnTGGAAACCTCTCA 
1 0 CTGTTACACATCCTCATCCATTCCTCCAATGCTTATTATATTTATGTCTGAGAGAAAATCCA 
TCTCCTTCA^GGCTGTGCTCTGCAGATGGTTGTGTCCCITGGCTrGGGCT 
GTCCTCCTGGCTGTGATGGCCTATGACCACTATGTGGCCATCTGCAACCCACTGAGGTACT 
CCATCATCATGAACGGAGTGCTGTATGTGCAAATGGCTGCATGGTCCTGGATCATAGGCTG 
TCTGACCTCCCTATTGCAAACAGTTCTGACAATGATGTTGCCTTTCTGTGGGAATAATGTC 

15 ISgATCATA™^ 

CAATGTGCTT^CATGACAGTGACAAATATTGTTTCACT 

TCATCTCCTATGTGTTTATTCTCTCTTCCATCCTGAGAATTAATTGTGCTGAGGGAAGAAAG 
AAAGCCTTTCTCT^ 

T^^^Si^O^GA^CCCAAGTCAAAGAACACTAATACATCTGATGAGATTATTGGGCTG 
20 TCTTATGGAGTGGTAAGCCCAATGTTAAATCCCATCATCTATAGCCTCAGGAATAAAGAGG 
TC/J^G/^GCTGTAAAGAAAGTCCTGAGCAGACATCTGCATTTATTGAAAATGTGA (SEQ 

ID NO: 228) 



25 



30 
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AOLFR124 sequences: 

MhMSVVTEFIILGLTKKPELQGIIFLFFLIVYLVAFLGN^ 

DnCTTSIffKMLGTMLTSENTISYAGCMSQLFLFTWSLGAEMVLFTTMAYDRYVAICFPLH 
VMNHHMCVALLSMVMAIAVTN 

MVY^I^AIGDFILTCISYGFUVAILRIRTVEGKRKAFSTCSS^ 
YTFERDKWAALYTLVTPTLNPMVYSFQNREMQAGIRKVFAFLKH (SEQ ID NO: 229) 



ATGAATCACAGCGTTGTAACTGAGTTCATTATTCTGGGCCTCACCAAAAAGCCTGAACTCC 
AG^GAATTATCTTCCTCTTTTTTCTCATTGTCT 

ACTGGCTGTTGTGG^ATCATCTGCACAACAAGCATCATACCGAAGATGCTGGGGACCAT 
35 GCTAACATCAGAAAATACCATITCATATGCAGGCrGCATGTCCCAGCTCrTCTrGTTCACA 
TG^TCTC^G^GGAGCTGAGATGGTTCTCTTCACCACCATGGCCTATGACCG^^^ 
TITGTTTCCCTCTrCATTACAGTACTGTTATGAACCACCATATGTGTGTAGCCTTGCTCAGC 
ATGGTCATGGCTATTGCAGTCACCAATTCCTGGGTGCACACAGCTCTTATCATGAGG1TGA 
C^CTCTG^GCCAAACACCATTGACCACTTCTTCTGTGAGATACCCCCA 
40 TCCTGTAGCCCTGTAAGAATCAATGAGGTGATGGTGTATGTTGCTGATATTACCCTGGCCA. 

TAGGGGACTTTA^ 

CGCACAGTAGAAGGCAAGAGGAAGGCCTTCTCAAC^TG 

acccWctatcctc^ 

A^GAGA^A^ 

45 ^CAG^CAGAATAGGGAGATC^AGGCAGGAATTAGGAAGGTGTTrGCAlTTCTGAAA 
CACTAG (SEQ ID NO: 230) 



AOLFR125 sequences: 
NO: 231) 
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GCTCAATCCrrCTCATCTACAGTCTTAGGAACAAGGAAGTGAAGGGAGCTCTGGGTCGAGTC 
TTTTCTCTCAAGTTTTGGAAGGGACAGTGA (SEQ ID NO: 222) 



10 



AOLFR121 sequences: 



MKRKNFreVSEFIFLGFSSFGKHQITLFVVFLTVYlLTLVANIIIVTIICro 
SSETVYTLVTWRMUiilJFHNQPre^ 

TVIMSKGLCAQLVCGSFGIGLTMAVIJJVTAMFNIJTCGT^ 

NYGVSSFVIFWIGLIFISYVLN^SSILQIASAEGRKKTFATCVSHLTVVIVHCGCASIAYLKPKSES 
SIEKDLVI^VTYTIITPLLNPVVYSLRNKEVKDAJLCRVVGRNIS (SEQ ID NO- 223) 



ATGAAGAGAAAGAACTTCACAGAAGTGTCAGAATTCATTTTCTTGGGATTTTCTAGCrTTG 
GAAAGCATCAGATAACCCTCTTTGTGGTTTTCCTAACTGTCTACATTTTAACTCTGGTTGCT 
AACATCATCATTGTGACTATCATCTGCATTGACCATCATCTCCACACTCCCATGTATTTCTT 
CCTAAGCATGCTGGCTAGTTCAGAGACGGTGTACACACTGGTCATTGTGCCACGAATGCTT 

1 5 TTGAGCCTCA 1 1 1 1 ICATAACCAACCTATCTCCTTGGCAGGCTGTGCTACACAAATGTTCTT 
TTTTGTTATCTrGGCCACTAATAATTGCTTCCTGCTTACTGCAATGGGGTATGACCGCTATG 
TGGCCATCTGCAGACCCCTGAGATACACTGTCATCATGAGCAAGGGACTATGTGCCCAGCT 
GGTGTGTGGGTCCTTTGGCATTGGTCTGACTATGGCAGTTCTCCATGTGACAGCCATGTTC 
AATTTGCCGTTCTGTGGCACAGTGGTAGACCACTTCTTTTGTGACATTTACCCAGTCATGA 

20 AACTTTCTTGCATTGATACCACTATCAATGAGATAATAAATTATGGTGTAAGTTCATTTGT 

GATTTTTGTGCCCATAGGCCTGATATTTATCTCCTATGTCCTTGTCATCTCTTCCATCCTTC 

AAATTGCCTCAGCTGAGGGCCGGAAGAAGACCTTTGCCACCTGTGTCTCCCACCTCACTGT 

GGTTATTGTCCACTGTGGCTGTGCCTCCATTGCCTACCTCAAGCCGAAGTCAGAAAGTTCA 

ATAGAAAAAGACCTTGTTCTCTCAGTGACGTACACCATCATCACTCCCTTGCTGAACCCTG 

25 TTGTTTACAGTCTGAGAAACAAGGAGGTAAAGGATGCCCTATGCAGAGTTGTGGGCAGAA 
ATATTTCTTAA (SEQ ID NO: 224) 

AOLFR122 sequences: 

MEWENQTILVEFFLKGHSVHPRLELLFFVLIFIMYVVILLGNGTLILISILDPHLHTPMYFFLGNL 
30 SFLDICTTTTSIPSTLVSFI^ERKTISFSGCAVQMFT.GLAMGTTECVLLGMMAFDRYVAICN^ 
YPIIMSKNAYWMAVGSWFAGIWSAVQTTFWQU>FC^ 
EFLMLVATILFIXMPLLLIVISYSLnSSI^^ 

K^TLNSDDLDATOKIISMFYGVMTPMMNPLIYSLRNKDVKEAVKHLPNRRFFSK (SEQ ID NO: 
225) 

35 

ATGGAATGGGAAAACCAAACCATTCTGGTGGAATTTTTTCTGAAGGGACATTCTGTTCACC 

CAAGGCTTGAGTTACTCTTTTTTGTGCTAATCTTCATAATGTATGTGGTCATCCTTCT 

AATGGTACTCTCATTTTAATCAGCATCTTGGACCCTCACCTTCACACCCCTATGTACTrCTT 

Afx TCTGGGGAACCTCTCCTTCTTGGACATCTGCTACACCACCACCTCTATTCCCTCCACACTAG 

40 TGAGCTTCCTTTCAGAAAGAAAGACCATTTCCTTTTCTGGCTGTGCAGTGCAGATGTTCCTT 
GGCTTGGCCATGGGGACAACAGAGTGTGTGCTTCTGGGCATGATGGCCTTTGACCGCTATG 
TGGCTATCTGCAACCCTCTGAGATATCCCATCATCATGAGCAAGAATGCCTATGTACCCAT 
GGCTGTTGGGTCCTGGTTTGCAGGGATTGTCAACTCTGCAGTACAAACTACATTTGTAGTA 
, CAATTGCCTTTCTGCAGGAAGAATGTCATCAATCATTTCTCATGTGAAATTCTAGCTGTCAT 

45 GAAGTTGGCCTGTGCTGACATCTCAGGCAATGAGTTCCTCATGCTTGTGGCCACAATATTG 
TTCACATTGATGCCACTGCTCTTGATAGTTATCTCTTACTCATTAATCATTTCCAGCATCCT 
CAAGATTCACTCCTCTGAGGGGAGAAGCAAAGCTTTCTCrACCTGCTCAGCCCATCTGACT 
GTGGTCATAATATTCTATGGGACCATCCTCTTCATGTATATGAAGCCCAAGTCTAAAGAGA 
CACTTAATTCAGATGACTTGGATGCTACCGACAAAATTATATCCATGTTCTATGGGGTGAT 

50 GACTCCCATGATGAATCCTTTAATCTACAGTCTTAGAAACAAGGATGTGAAAGAGGCAGT 
AAAACACCTACCGAACAGAAGGTTCTTTAGCAAGTGA (SEQ ID NO: 226) 

AOLFR123 sequences: 

MYRFTDFDVSMSr^NHVLFYTTQQAGDLEHMETRl^SAMTEFFLVGLSQYPELQLFLFLLCL 
35 IMYMIILLGNSLLramDSRmTTMYFFLGNI^FLDICYTSSSIPPMLIIFMSERKSISFIGCALQM 
WSLGLGSTECVIXAVMAYDHWAIC^LRYSIIMNGVLYVQ 
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TTGCCAGTTGGTTCTGGCATGGCTTGGAATGGGGAGTGATCTAAGTCrTATTATACTGTCA 
TATATTTTGATTCTGTACTCTGTACTTAGACTGAACTCAGCTGAAGCTGCAGCCAAGGCCC 
TGAGCACTTGTAGTTCACATCTCACCCTCATCCTTTTCTTTTACACTATTGTTGTAGTGATT 
TCAGTGACTCATCTGACAGAGATGAAGGCTACTTTGATTCCAGTTCTACTTAATGTGTTGC 
5 ACAACATCATCCCCCCTTCCCTCAACCCTACAGTTTATGCACTTCAGACCAAAGAACTTAG 
GGCAGCCTTCCAAAAGGTGCTGTTTGCCCTTACAAAAGAAATAAGATCTTAG (SEQ ID NO: 

218) 

AOLFR119 sequences: 

1 0 MPLFNSLCWFPTIHVTPPSFILNGIPGLERVHVWISLPLCTMYIIFLVGNLGLVYLIYYEESLHHP 
MYFFFGHALSLIDLLTCTTTLPNALCffWFSLKEINFNAClAQMFFVHGFTGVESGVLMLMALD 
RYIAICYPLRYATTLTNPI1AKAELATFLRGW.LNUPFPFLVKPXPFCQSMISHTYCDHMSVVKL 
SCASIKVNV1YGLMVALLIGVFDICCISLSYTLILKAAISLSSSDARQKAFSTCTAH1SAIIITYVPA 
FFTFFAHRFGGHTIPPSLHIIVANLYLLLPPTLNPIWGVKTKQIRKSV1KFFQGDKGAG (SEQ ID 

15 NO: 219) 

ATGCCTCTATTTAATTCATTATGCTGGTTTCCAACAATTCATGTGACTCCTCCATCTTTTAT 
TCTTAATGGAATACCTGGTCTGGAAAGAGTACATGTATGGATCTCCCTCCCACTCTGCACA 
ATGTACATCATCTTCCTTGTGGGGAATCTTGGTCTTGTGTACCTCATTTATTATGAGGAGTC 

20 CTTACATCATCCGATGTATTTTTTTTTTGGCCATGCTCTCTCCCTCATTGACCTCCTTACCTG 
CACCACCACTCTACCCAATGCACTCTGCATCTTCTGGTTCAGTCTCAAAGAAATTAACTTCA 
ATGCTTGCTTGGCCCAGATGTTCTTTGTTCATGGGTTCACAGGTGTGGAGTCTGGGGTGCT 
CATGCTCATGGCTCTAGACCGCTATATAGCCATTTGCTACCCTTTGCGTTATGCTACCACAC 
TCACCAACCCTATCATTGCCAAGGCTGAGCTTGCCACCTTCCTGAGGGGTGTATTGCTGAT 

25 GATTCCTTTCCCATrCTTGGTTAAGCGTTTGCCrTTCTGCCAAAGCAATATTATCTCCCA^ 
CGTACTGCGACCACATGTCTGTAGTAAAGCTATCTTGTGCCAGCATCAAGGTCAATGTAAT 
CTATGGTCTAATGGTTGCTCTCCTGATTGGAGTGTTTGACATTTGTFGTATATCTTTGTCTT 
ACACTTTGATCCTCAAGGCAGCGATCAGCCTCTCTTCATCAGATGCTCGGCAGAAGGCTTT 
CAGCACCTGCACTGCCCATATATCTGCCATCATCATCACCTATGTTCCAGCATTCTTCACTT 

30 TCTTTGCCCACCGTTTTGGGGGACACACAATTCCCCCTTCTCXrCACATCATTGTGGCTAAT 
CTTTATCTTCTTCTTCCCCCAACTCTAAACCCTATTGTTTATGGAGTAAAGACAAAACAGAT 
ACGCAAGAGTGTCATAAAGTTCTTCCAGGGTGATAAGGGTGCAGGTTGA (SEQ ID NO: 220) 

AOLFR120 sequences: 

35 MQPYTKNWTQVTEFVMMGFAGIHEAHLLFFILFLTMYLFTLVE>n.AIILWGLDHRLRRPMYF 
FLTHLSCLEIWYTSVTWKMLAGFIGVDGGKNISYAGCLSQLFIFTFLGATECFLLAAMAYDRY 
VMCMPUIYGAFVSWGTCIPXAAACWLVGFLTPILPIYLLSQLTFCGPNVIDHFSCDASPLLALS 
CSDWWKETVDFLVSLAVLLASSMVIAVSYGN1VWTLLHIRSAAERWKAFSTCAAHLTVVSLF 

YGTLFFMYVQTKVTSSINFNKVVSVFYSVVTPMLW 

40 (SEQ ID NO: 221) 

ATGCAACCATATACCAAAAACTGGACCCAGGTAACTGAATTTGTCATGATGGGCTTTGCTG 
GCATCCATGAAGCACACCTCCTCTTCTTCATACTCTTCCT^ 

GTGGAGAATTTGGCCATCATITTAGTGGTGGGTTTGGACCACCGACTACGGAGACCCATGT 
45 ATTrCTrCCTGACACACTTGTCCTGCCTTGAAATCTGGTACACTTCTGTTACAGTGCCCAAG 
ATGCTGGCTGGTTTTATTGGGGTGGATGGTGGCAAGAATATCTCTTATGCTGGTTGCCTAT 
CCCAGCTCTTCATCTrCACCTTTCTTGGGGCAACTGAGTGTTTCCTACTGGCTGCCATGGCC 
TATGATCGTTATGTGGCCATTTGTATGCCTCTCCACTATGGGGCTTTTGTGTCCTGGGGCAC 
CTGCATCCGTCTGGCAGCTGCCTGTTGGCTGGTAGGTTTCCTCACACCCATCTTGCCAATCT 
50 ACCTCTTGTCTCAGCTAACATTTTGTGGCCCAAATGTCATTGACCATTTCTCCTGTGATGCC 
TCACCCTTGCTAGCCTTGTCGTGCTCAGATGTCACTTGGAAGGAGACTGTGGATTTCCTGG 
TGTCTCTGGCTGTGCTACTGGCCTCCTCTATGGTCATTGCTGTGTCCTATGGCAACATCGTC 
TGGACACTGCTGCACATCCGCTCAGCTGCTGAGCGCTGGAAGGCCTTCTCTACCTGTGCAG 
CTCACCTGACTGTGGTGAGCCTCTTCrATGGCACTCTTTTCTTTATGTATGTCCAGACCAAG 
55 GTGACCTCCTCCATCAACTTCAACAAGGTGGTATCTGTCTTCTACTCTGTTGTCACGCCCAT 
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GTGGCCATATGTAAGCCTCTCCACTACCTGACCATCATGAACCCACAAAGGTGCATTTTGT 
TTTTAGTCATTTCCTGGATTATAGGTATTATrCACTCAGTGATTCAGTTGGCTriTGTrGTA 
GACCTGCTGTTCTGTGGCCCTAATGAATTAGATAGTTTCTTTTGTGATC^CCTCGATTTAT 
CAAACTGGCTTGCATAGAGACCTACACATTGGGATTCATGGTTACTGCCAATAGTGGATTT 
ATTTCTCTGGCTTCITTTTTAATTCTCATAATCTCTTACATCTTTATTT^ 

AAAAAATCTTCAGGTGGTATATrCAAGGCTTTCTCTATGCTGTCAGCTCATGTCATTGTGG 
TGGTTTTGGTCTTTGGGCCATTAATCTTTTTCTATATTTTTCCATTTCCCACATCACATC^ 
ATAAATTCCTTGCCATCTTTGATGCAGTTATCACTCCCGTTTTGAATCCAGTCATCTATACT 
TTTAGAAATAAAGAGATGATGGTGGCAATGAGAAGACGATGCTCTCAGTTTGTGAATTAr 
AGTAAAATCTTTTAA (SEQ ID NO: 214) 

AOLFR117 sequences: 

NfiWTIWVIKIQIEKSDLKYlVUSLQEISKISLLFW^ 

QHEFWCILFIVFLLIYVTSIMGNSGIILLINTDSRFQTLTYFFLQHIJSJ ? VDICYTS 
EEKNLILFQGCVIQFLVYATFATSDCY1.LAMMAVDPYVAICKP 

MGSINASVQTGFTCSLSFCKSNSINHFFCDVPPIT^I <5CS>A/nrNlTwrT T vwvr.Q^ ictp.t ,nnro 



ATGAATAACACTATTGTATTTGTCATAAAAATACAAATAGAAAAAAGTGACTTGAAATATA 

GAGCCATTTCATTGCAAGAAATCTCAAAGATTTCCCnTCTTTTCTGGGTCCTTCTCTTGGTC 

ATTTCTAGACTTTTACTAGCCATGACACTAGGAAACAGCACTGAAGTCACTGAATTCTATC 

TTCTGGGATTTGGTGCCCAGCATGAGTTTTGGTGTATCCTCITCATTGTATTCCTTCTCATC 

TATGTGACCTCCATAATGGGTAATAGTGGAATAATCTTACTCATCAACACAGATTCCAGAT 

TTCAAACACTCACGTACTTTTTTCTACAACATTTGGCTTTT 

GCTATCACTCCCAAGATGCTCCAAAGCTTCACAGAAGAAAAGAATTTGATATTATTTCAGG 

GCTGTGTGATACAATTCrTAGTTTATGCAACATTTGCAACCAGTGACTGTTATCTCCTGGCT 

ATGATGGCAGTGGATCCTTATGTTGCCATCTGTAAGCCCCTTCACTATACTGTAATCATGT 

CCCGAACAGTCTGCATCCGTTTGGTAGCTGGTTCATACATCATGGGCTCAATAAATGCCTC 

TGTACAAACAGGTTTTACATGTTCACTGTCCITCTGCAAGTCCAATAGCATCAATCACTrTT 

TCTGTGATGTTCCCCCTATTCTTGCTCTTTCATGCTCCAATGTTGACATCAACATCATGCTA 

CTTGTTGTCTTTGTGGGATCTAACTrGATATTCACTGGGTTGGTCGTCATCTTTT 

CTACATCATGGCCACCATCCTGAAAATGTCTTCTAGTGCAGGAAGGAAAAAATCCTTCTCA 

ACATGTGCTTCCCACCTGACCGCAGTCACCATTTTCTATGGGACACTCTCrTACATGTATTT 

GCAGTCTCATTCTAATAATTCCCAGGAAAATATGAAAGTGGCCriTrATATTTTATGGCACA 

GTTATTCCCATGTTAAATCCTTTAATCTATAGCTTGAGAAATAAGGAAGTAAAAGAAGCTT 
TAAAAGTGATAGGGAAAAAGTTATTTTAA (SEQ ID NO: 216) 

AOLFR118 sequences: 

MNHMSASLKISNSSKFQVSEFnLLGFPGfflSWQHWLSIPIALLYI^ALAANT^ 
PMYIFLGJLCMVDMGIATTnPKJIAIFWFDAKVISIJE^ 

VAICHPLRYPSrVTSSLILKATLFMVLRNGLFVTPWVLAAQRDYCSKNEIEHCLCSNLGVTSLA 
CDDRPJ>NSICQLVLAWLGMGSDLSLin^YILILYSVLRLNSAEAAAKAL5TCSSHLTLILFFYTrV 
V^SV^TEMKATLIPVLLNVLHM^ (SE Q ID 

ATGAATCATATGTCTGCATCTCTCAAAATCTCCAATAGCTCCAAATTCCAGGTCTCTGAGTT 

CATCCTGCTGGGATTCCCGGGCATTCACAGCTGGCAACACTGGCTATCTCTGCCCCTGGCA 

CTACTGTATCTCTCAGCACTTGCTGCAAACACCCTCATCCTCATCATCATCTGGCAGAACCC 

TTCTTTACAGCAGCCCATGTATATTTTCCTTGGCATCCTCTGTATGGTAGACATGGGTCTGG 

CCACTACTATCATCCCTAAGATCCTGGCCATCTTCTGGTTTGATGCCAAGGTTATTAGCCTC 

CCTGAGTGCITTGCTCAGATTTATGCCATTCACITCrTTGTGGGCATGGAGTCTGGTATCCT 

ACTCTGCATGGCTrTTGATAGATATGTGGCTATTTGTCACCCTCTTCGCTATCCATCAATTG 

TCACCAGTTCCrTAATCTTAAAAGCTACCCTGTTCATGGTGCTGAGAAATGGCTTATTTGTC 

ACrCCAGTGCCTGTGCTTGCAGCACAGCGTGATTATTGCTCCAAGAATGAAATTGAACACT 

GCCTGTGCTCTAACCTTGGGGTCACAAGCCTGGCTTGTGATGACAGGAGGCCAAACAGCAT 
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ATGGAAAGAATCAACCACACCAGCAGTGTCTCCGAGTTTATCCTCCTGGGACTCTCCTCCC 
GGCCTGAGGACCAAAAGACACTCTITGTTCTCTTCCTCATCGTGTACCTGGTCACCATAA 
AGGGAACCTGCTCATCATCCTGGCCATTCGCTTCAACCCCCATCTTCAGACCCCTATGTAIT 
TCTTCITGAGTTTTCTGTCTCTCACTGATATTTGCTTTACAACAAGCGTTGTCCCCAAGATG 
5 CTGATGAACrrCCTGTCAGAAAAGAAGACCATCTCCTATGCTGGGTGTCTGACACAGATGT 
ATTITCTCTATGCCTTGGGCAACAGTGACAGCTGCCnTCTGGCAGTCATGGCCTTTGACCG 
CTATGTGGCCGTCTGTGACCCTTTCCACTATGTCACCACCATGAGCCACCACCACTGTGTCC 
TGCTGGTGGCCTTCTCCTGCTCATTTCCTCACCTCCACTCACTCCTGCACACACTTCTGCTG 
AATCGTCTCACCTrCTGTGACTCCAATGTTATCCACCACTTTCTCTGTGACCTCAGCCCTGT 

1 0 GCTGAAATTGTCCTGCTCTTCCATATTTGTCAATGAAATTGTGCAGATGACAGAAGCACCT 
A^GTTTTGGTGACTCGTTITCTCTGCATTGCTTTCTCTTATATACGAATCCTCACTACAGT 
TCTCAAGATTCCCTCTACTTCTGGGAAACGCAAAGCCTTCTCCACCTGTGGTTTTTACCTCA 
CCGTGGTGACGCTCTTTTATGGAAGCATCTTCTGTGTCTATTTACAGCCCCCATCCACCTAC 
GCTGTCAAGGACCACGTGGCAACAATTGTTTACACAGTTTTGTCATCCATGCTCAATCC1T 

1 5 TTATCTACAGCCTGAGAAACAAAGACCTGAAACAGGGCCTGAGGAAGCTTATGAGCAAGA 

GATCCTAG (SEQ ID NO: 210) 



20 



25 



45 



50 



AOLFR115 sequences: 

MEGFYLWISHELQGMGKPGRWQTTVSDFLLLGLSEWPEEQPLLFGIFLGMYLVTMVGNLLII 
LAISSDPHLHTPMYFFLANLSLTDACFTSASIPKMLANIHTQSQIISYSGCLAQLYFLLMFGGLD 
NCLLAVMAYDRYVAICQPLHYSTSMSPQLCALMLGVCWVLTNCPALMHTLLLTRVAFCAQK 

aTphfy™ps^^^ 

F^TCGSHLTVVLLFYGSLMGVYLLPPST^ 
KLFVSGKTFFL (SEQ ID NO: 211) 



ATGGAAGGTT 



TTTATCTGCGCAGATCACACGAACTACAAGGGATGGGAAAACCAGGCAGA 



GTGAACCAAACCACTGTTTCAGACXTCCTCCTTCTAGGACTCTCTGAGTGGCCAGAGGAGC 
AGCCTCTTCTGTTTGGCATCnTCCTTGGCATGTACCTGGTCACCATGGTGGGGAACCTGCTC 
AT^ATCCTGGCCATCAGCTCTGACCCACACCrCCATACTCCCATGTACTTCTTTCTGGCCAA 
30 CCTGTCATTAACTGATGCCTGTTTCACTTCTGCCTCCATCCCCAAAATGCTGGCCAACA 

aScccagaotcagatcatctcgtattctgggtgtcttgcacagctatatitcct 

TTTGGTG^GCCTTGACAACTGCCTGCrGGCTGTGATGGCATATGACCGCTATGTGjGC 
GCCAACCACTCCATTACAGCACATCTATGAGTCCCCAGCTCTGTGCACTAATGCTGGGTGT 

ctSctSgtcctaaccaactgtcctgccctgatgcacacactg^ 

35 CTCTGTGCCCAGAAAGCCATCCCTCATTTCTATTC 

CTGCTCAGATACCCATGTAAACGAGCTGATGATCATCACCATGGGCTTGCTGTTCCTCACT 

GTTCCCCTCCTGCTGATCGTCTTCTCCTATGTCCGCATTTTCTGGGCT 

ATCTCCTGGAGGGAGATGGAAGGCCrrTCTCTACCTGTGGTTCTCATCTCACGGTGGTTCTG 
CTCTTCTATGGGTCTCnrTATGGGTGTGTATTTACTTCCTCCATCAACTTACTCTACAGAGAG 

40 GGAAAGTAGGGCTGCTGTTCTCTATATGGTGATTATTCCCACGCTA^^ 

AGCTTGAGGAACAGAGACATGAAGGAGGCXrTGGGTAAACrTTTTGTCAGTGGAAAAACA 

TTCTTTTTATGA (SEQ ID NO: 212) 



AOLFR116 sequences: 

MDEANHSWSEFVFLGLSDSRKIQLLLFLFFSWYVSSLMGNLLrVE^ 

LSimLWCSSTAPKMr^LFRKHKTISFGGCWQffFIHAVGGTEM^LIAMAITDRWAICKPI^ 
YLTININPQRCILFLVISWIIGIIHSVIQLAFVVDLLFCGPNELDSFFCDLPRFIK1ACIETYTLGFMV 
TANSGFlSLASFLILIISYIFILVTVQKKSSGGIFKAFSMLSAHVrVVVLVFGPLIFFYlFPFPTSHLD 
T<FLAlFDAVITPVLNPVrYrFRNKEMMVAMRRRCSQFVNYSKIF (SEQ ID NO: 213) 



ATGGATGAAGCCAATCACTCTGTGGTCTCTGAGTTTGTGTTCCTGGGACTCTC^ 
GGAAGATCCAGCTCCTCCTCTrCCTCTTTTTCTCAGTGTTCTATGTATCAAGCCTGATGGGA 

AA^OTcX^^^ 

GCTGGCCAACCTTTCCATCATCAATTTGGTATTTTGTTCCTCCACAGCTCCCA 
TATCCAT< 



GCAGTTGGGGGAACTGAGATGGTGCTGCTCATAGCCATGGCTTTTGACCGATAT 
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SICLPHLLVVm^TGFIAYIJaPASESP^ 
KLTKK (SEQ ID NO: 205) 

ATGTGTTATATATATTTAATATTTAAAGA 

5 TTTCCTGCAGATTACTCCTGCAATAATGGCAAATCTCACAATCGTGACTGAATTTATCCTTA 

TGGGGTTTTCTACCAATAAAAATATGTGCATTTTGCATTCGATTCTCITCTTGTTGATTTAT 

TTGTGTGCCCTGATGGGGAATGTCCTCATTATCATGATCACAACTTTGGACCATCATCTCC 

ACACCCCCGTGTATTTCITCTTGAAGAATCTATCTTrCTTGGATCTCTGCCTTATTTCAGTC 

ACGGCTCCCAAATCTATCGCCAATTCTTTGATACACAACAACrcCATTTCATTCCTTGGCTG 

TGTTTCCCAGGTC111-1UGTTGCTTTCTTCAGCATCTGCAGAGCTGCTCCTCCTCACGGTGA 

TGTCCTTTGACCGCTATACTGCTATATGTCACCCTCTGCACTATGATGTCATCATGGACAGG 

AGCACCTGTGTCCAAAGAGCCACTGTGTCTTGGCTGTATGGGGGTCTGATTGCTGTGATGC 

ACACAGCTGGCACCTTCTCCTTATCCTACTGTGGGTCCAACATGGTCCATCAGTTCTTCTGT 

GACATTCCCCAGTT ATTAG CTATTTCTTGCTCAGAAAATTTAATAAGAGAAATTGCACTCA 

1 5 TCCTTATTAATGTAGTTTTGGATTTCTGCTGTTTTATTGTCATCATCATTACCTATGTCCAC 

GTCTTCTCTACAGTCAAGAAGATCCCTTCCACAGAAGGCCAGTCAAAAGCCTACTCTATTT 

GCCTTCCACACTTGCTGGTTGTGTTATTTCTTTCCACTGGATTCATTGCTTATCTGAAGCCA 

GCTTCAGAGTCTCCITCTATTTTGGATGCTGTAATTTCTGTGTTCTACACTATGCTGCCCCC 

AACCTITAATCCCATTATATACAGTTTGAGAAACAAGGCCATAAAGGTGGCTCTGGGGATG 
20 TTGATAAAGGGAAAGCTCACCAAAAAGTAA (SEQ ID NO: 206) 

AOLFR113 sequences: 

MKFWHGFSSHLNPMFSSFIXYI^LPWINTTIQAWLN^^ 

SWTCSSSQSSDWMQLCTHLCTTLSVFWSWSCGIQLPLSLRCCLIFS\TUUCPFLLQDASFTU^ 

TPWGACECYLLTAMAYDRYLAICRPLHYPI1MTTTLCAKMAAACWTCGFLCPISEVILASQLPF 

CAYNEIQHIFCDFPPLI^LACKDTSANILVDFAINAFIILITFFFIMISYARUGAVLKIKTASGRKK 

AFSTCASHIAVVUFFGSIIFMYVRLKKSYSLTLDRTLAIVYSVLTPMVNPirV'SLRNK 
TIFQKGDKASLAHL (SEQ ID NO: 207) 

30 ATGTGTCAACAAATCTTACGGGATTGCATTCITCTCATACATCATTTGTGCATTAACAGGA 
AAAAAGTCTCACTTGTGATGCTGGGTCCAGCTTATAACCACACAATGGAAACCCCtGCCTC 
CTTCCTCCITGTGGGTATCCCAGGACTGCAATCTTCACATCTTTGGCTGGCTATCTCACTGA 
GTGCCATGTACATCATAGCCCTGTTAGGAAACACCATCATCGTGACTGCAATCTGGATGGA 
TTCCACTCGGCATGAGCCCATGTATTGCTTTCTGTGTGTTCTGGCTGCTGTGGACATTGTTA 

35 TGGCCTCCTCGGTGGTACCCAAGATGGTGAGCATCTTCTGCTCAGGAGACAGCTCAATCAG 
CTTTAGTGCrTGTTTCACTCAGATGTTTTTTGTCCACTTAGCCACAGCTGTGGAGAGGGGG 
CTGCTGCTGACCATGGCTTTTGACCGCTATGTAGCCATCTGCAAGCCTCTACACTACAAGA 
GAATTCTCACGCCTCAAGTGATGCTGGGAATGAGTATGGCCATCACCATCAGAGCTATCAT 
AGCCATAACTCCACTGAGTTGGATGGTGAGTCATCTACCTTTCTGTGGCTCCAATGTGGTT 

40 GTCCACrcCrACTGTGAGCACATAGCTTTGGCCAGGTTAGCATGTGCTGACCCCGTGCCCA 
GCAGTCTCTACAGTCTGATTGGTTCCTCTCTTATGGTGGGCTCTGATGTGGCCTTCATTGCT 
GCCTCCTATATCTTAATTCTCAAGGCAGTATTTGGTCTCTCCTCAAAGACTGCTCAGTTGAA 
AGCATTAAGCACATGTGGCTCCCATGTGGGGGTTATGGCTTTGTACTATCTACCTGGGATG 
GCATCCATCTATGCGGCCTGGTTGGGGCAGGATGTAGTGCCCTTGCACACCCAAGTCCTGC 

45 TAGCTGACCTGTACGTGATCATCCCAGCCACCTTAAATCCCATCATCTATGGCATGAGGAC 
CAAACAACTGCGGGAGAGAATATGGAGTTATCTGATGCATGTCCTCTTTGACCATTCCAAC 
CTGGGTTCATGA (SEQ ID NO: 208) 



25 



AOLFR114 sequences: 

50 MERINHTSSVSEFILLGLSSRPEDQKTLFVLFLrVYLVTITGNLLIIIAI^^ 

LTDICFTTSVWKMLMNFI^EKKTISYAGCLTQMYFLYALGNSDSCIXAVMAFDRYVAVCDPF 
HYVTTMSHHHCVLLVAFSCSFPHLHSLLHTLLLNRLTFCDSNVIHHFLCDI^ 
ErVQMTEAPIVLVTRFLCIAFSYIRILTTVIJCIPSTSGKRKAFSTCGFYLTV^ 
PSTYAVIODHVATrVYTVI^SMLhJPFr/SLPJ^LKQGLRKLMSKRS (SEQ ID NO: 209) 
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AOLFR109 sequences: 

MLIU^GSIVTEFILVGFQQSSTSTRALLFALFLALYSLTMAMNGLIIFITSWTDPKLNSPMYFFLG 
HLSLLDVCFITTTIPQMLIHLVVRDH1VSFVCCMTQMYFVFCVGVAECILLAFMAYDRYVAICY 
5 PLNYVPIISQKVCVPO.VGTAWFFGLINGIFLEYISFREPFRRDNHIESFFCEAPIVIGLSCGDPQFSL 
WAIFADAJVVIIJSPMVLTVTSY\^ILATILSKASSSGRGKTFSTCASHLT\A/TFLYTSAMFSYMN 
PHSTHGPDKX>KPFSLLYTUTPMCNPIIYSFRNKEIKEAMVRALGRTRLAQPQSV (SEQ ID NO: 

201) 

1 0 ATGCTAAGGAATGGCAGCATAGTGACGGAATTTATCCTCGTGGGCTTTCAGCAGAGCTCCA 
CTTCCACACGAGCATTGCTCTTTGCCCTCTTCTTGGCCCTCTACAGCCTCACCATGGCCATG 
AATGGCCTCATCATCTTTATCACCTCCTGGACAGACCCCAAGCTCAACAGCCCCATGTACT 
TCTrCCTCGGCCATCTGTCTCTGCTGGATGTCTGCTTCATCACCACTACCATCCCACAGATG 
TTGATCCACCTCGTGGTCAGGGACCACATTGTCTCCTTTGTATGTTGCATGACCCAGATGT 

1 5 ACTTTGTCTTCTGTGTTGGTGTGGCCGAGTGCATCCTCTTGGCTTTCATGGCCTATGACCGT 
TATGTTGCTATCTGCTACCCACTTAACTATGTCCCGATCATAAGCCAGAAGGTCTGTGTCA 
GGCTrGTGGGAACTGCCTGGTTCTTTGGGCTGATCAATGGCATCTTTCTCGAGTATATTTC 
ATTCCGAGAGCCCTTCCGCAGAGACAACCACATAGAAAGCTTCTTCTGTGAGGCCCCCATA 
GTGATTGGCCTCTCTTGTGGGGACCCTCAGTTTAGTCTGTGGGCAATCTTTGCCGATGCCA 

20 TCGTGGTAATTCTCAGCCCCATGGTGCTCACTGTCACTTCCTATGTGCACATCCTGGCCACC 
ATCCTCAGCAAAGCCTCCTCCTCAGGTCGGGGGAAGACTTTCTCTACTTGTGCCTCTCACC 
TGACTGTGGTCATCTTTCTCTACACTTCAGCTATGTTCTCTTACATGAACCCCCACAGCACA 
CATGGGCCTGACAAAGACAAACCTTTCTCCCTCCTGTACACCATCATTACCCCCATGTGCA 
ACCCCATCATTTATAGTTTCCGCAACAAGGAAATTAAGGAGGCCATGGTGAGGGCACTTG 

25 GAAGAACCAGGCTGGCCCAGCCACAGTCTGTCTAG (SEQ ID NO: 202) 

AOLFR110 sequences: 

MKIANNTVVTEFILLGLTQSQDIQLLVFVLILlFYLIILPGNFLinTmSDPGLTAPLY^^ 
DASYSFIVAPRNILVT)FLSEKKVISYRGCITQLFFLHFLGGGEGLLLVVMAFDRYIAICRPLHCST 
30 VMWRACYAMMIALWLGGFVHSDQVVLILRLPFCGPNQLDNFFCDWQVIKIACTDMFVVEL 
LMVFNSGLMTLLCFLGLLASYAVILCHVRRAASEGKNKAMSTCTTRVIIILLMFGPAIFIYMCPF 
RALP ADKM V SLFH WIFPLMOTMIYTLBNQE\nCTSMKRLLSRHVVCQ VDFIIRN (SEQ ID NO: 
203) 

35 ATGAAGATAGCAAACAACACAGTAGTGACAGAATTTATCCTCCTTGGTCTGACTCAGTCTC 
AAGATATTCAGCTCTTGGTCTTTGTGCTGATCTTAATTTTCTACCTTATCATCCTCCCTGGA 
AATTTTCTCATTATTTTCACCATAAGGTCAGACCCTGGGCTCACAGCCCCCCTCTATTTATT 
TCTGGGCAACTTGGCCrTCCTGGATGCATCCTACTCCTTCATTGTGGCTCCCAGGATGTTGG 
TGGACTTCCTCrCTGAGAAAAAGGTAATCTCCTACAGAGGCTGCATCACTCAGCTCTTTTT 

40 CTTGCACTTCCTTGGAGGAGGGGAGGGATTACTCCITGTTGTGATGGCCTTTGACCGCTAC 
ATCGCCATCTGCCGGCCTCTGCACTGTTCAACTGTCATGAACCCTAGAGCCTGCTATGCAA 
TGATGTTGGCTCTGTGGCTTGGGGGTTTTGTCCACTCCATTATCCAGGTGGTCCTCATCCTC 
CGCTTGCCTITTTGTGGCCCAAACCAGCTGGACAACTTCTTCTGTGATGTCCGACAGGTCA 
TCAAGCTGGCTTGCACCGACATGTITGTGGTGGAGCTTCTAATGGTCTTCAACAGTGGCCT 

45 GATGACACTCCTGTGCTTTCTGGGGCTTCTGGCTTCCTATGCAGTCATCCTCTGCCATGTTC 
GTAGGGCAGCTTCTGAAGGGAAGAACAAGGCCATGTCCACGTGCACCACTCGTGTCATTA 
rrATACTTCTTATGTrrGGACCTGCTATCTTCATCTACATGTGCCCTTTCAGGGCCTTACCA 
GCTGACAAGATGGTTTCTCTCTTTCACACAGTGATCTTTCCATTGATGAATCCTATGATTTA 
TACCCTTCGCAACCAGGAAGTGAAAACTTCCATGAAGAGGTTATTGAGTCGACATGTAGTC 

5 0 TGTCAAGTGGATTTTATAATAAGAAACTGA (SEQ ID NO: 204) 
AOLFR111 sequences: 

MCYIYLffK^WTLIFYFSLLLFLQITPAIMANLTIVTEFILMGFSTNKNMCn-HSILFLLrVT^CALM 
GNVLIIMITT1JDHHLHTPVYFFLKNLSFLDLCLISVTAPKSIANSLIHNNSISFLGCVSQVFLLLSS 
5 5 ASAELLLLT WSFDRYTAICHPLimDVIMDRSTCVQRATVSWLYGGLIAVMHTAGTFSLSYCG 
SNMVHQFFCDIPQLLAISCSENLIREIALILINVVL^ 
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GAAACTITCAGGCTCAGGTGAGAATACCAACAGGGCCATGTCCACCTGCTATTCCCACATT 

ACCATTGTGGTGCTAATGTTTGGGCCATCCATCTACATTTATGCTCGCCCATTTGACTCGTT 

TTCCCTAGATAAAGTGGTGTCTGTGTTCAATACTTTAATATTCCCTTTACGTAATCCCATTA 

TTTACACATTGAGAAACAAGGAAGTAAAGGCAGCCATGAGGAAGTTGGTCACCAAATATA 
TTTTGTGTAAAGAGAAGTGA (SEQ ID NO: 196) 



AOLFR107 sequences: 

MELWM^TLGSGFILVGILNDSGSPELLCATITTLYLLALISNGLLLLAITMEA 
LSLMDLLFTSWTPKALADFLRJ^NTISFGGCALQMFLALTMGGAEDLLLAFMAYDRYVAICH 
PLTYMTLMSSP^CWLMVATSWII^SI^ALIYTVYTMHYPFCRAQEIRHLLCEIPHLLKVACA^ 
TSRYEUvfVYVMGVTFLIPSLAAILASYTQILLTVL^ 

ATFMYVLPSSFHSTRQDhmSVFVTIVTPAIJ^LIYSLRNKEVMRA 
(SEQ ID NO: 197) 

ATGGAGCTCTGGAACTTCACCTTGGGAAGTGGCTTCATTTTGGTGGGGATTCTGAATGACA 

GTGGGTCrCCTGAACTGCTCTGTGCTACAATTACAATCCTATACTTGTTGGCCCTGATCAG 

CAATGGCCTACTGCTCCTGGCTATCACCATGGAAGCCCGGCTCCACATGCCCATGTACCTC 

CTGCTTGGGCAGCTCTCTCTCATGGACCTCCTGTTCACATCTGTTGTCACTCCCAAGGCCCT 

TGCGGACTTTCTGCGCAGAGAAAACACCATCTCCTTTGGAGGCTGTGCCCTTCAGATGTTC 

CTGGCACTGACAATGGGTGGTGCTGAGGACCTCCTACTGGCCTTCATGGCCTATGACAGGT 

ATGTGGCCATTTGTCATCCTCTGACATACATGACCCTCATGAGCTCAAGAGCCTGCTGGCT 

CATGGTGGCCACGTCCTGGATCCTGGCATCCCTAAGTGCCCTAATATATACCGTGTATACC 

ATGCACTATCCCTTCTGCAGGGCCCAGGAGATCAGGCATCTTCTCTGTGAGATCCCACACT 

TGCTGAAGGTGGCCTGTGCTGATACCTCCAGATATGAGCTCATGGTATATGTGATGGGTGT 

GACCrTCCTGATTCCCTCTCTTGCTGCTATACTGGCCTCCTATACACAAATTCTACTCACTG 

TGCTCCATATGCCATCAAATGAGGGGAGGAAGAAAGCCCTTGTCACCTGCTCTTCCCACCT 

GACTGTGGTTGGGATGTTCTATGGAGCTGCCACATTCATGTATGTCTTGCCCAGTTCCITCC 

ACAGCACCAGACAAGACAACATCATCTCTGTTTTCTACACAATTGTCACTCCAGCCCTGAA 

TCCACTCATCTACAGCCTGAGGAATAAGGAGGTCATGCGGGCCTTGAGGAGGGTCCTGGG 

AAAATACATGCTGCCAGCACACTCCACGCTCTAG (SEQ ID NO: 198) 

AOLFR108 sequences: 

MCSI^LCQTGKQAKISMGEENQTFA'SKFIFLGI^QDLQTQILIJ^Lir^LTVLGNQLinLIFLD 

SRIJITPMYFFLRNLSFADLCFSTSIVPQVLVHFLVKRKTISFYGCMTQIIVFLLVGCTECALLAV 

MSYDRYVAVOO>LYYSTIMTQRVCLWLSFRSWASGALVSLVDTSFTFHLPYWGQNnNHYFCE 

PPALLKLASroTYSTEMAIFSMGWILl^VSLILGSYWMlSTVIQMQSGEGRLKAFSTCGSHLI 

VVVLFYGSGIFTYMRPNSKTTKJELDKMISVFYTAVTPMLNPirySLRNKDVKG 

FSHRQ (SEQ ID NO: 199) 



ATGTGTTCTTTTTTCTTGTGCCAAACAGGTAAACAGGCAAAAATATCAATGGGAGAAGAAA 

ACCAAACCTTTGTGTCCAAGTTTATCTTCCTGGGTCTTTCACAGGACTTGCAGACCCAGAT 

CCTGCTATTTATCCTTTTCCrCATCATTTATCTGCTGACCGTGCTTGGAAACCAGCTCATCA 

TCATTCTCATCTTCCTGGATTCTCGCCTTCACACTCCCATGTAI 1 1 1 1 T TC TTAGAAATCTCT 

CCTTTGCAGATCTCTGT TTCTCT ACTAGCATTGTCCCTCAAGTGTTGGTTCACTTCTTGGTA 

AAGAGGAAAACCATTTCTTTTTATGGGTGTATGACACAGATAATTGTCTTTCTTCTGGTTG 

GGTGTACAGAGTGTGCGCTGCTGGCAGTGATGTCCTATGACCGGTATGTGGCTGTCTGCAA 

GCCCCTGTACTACTCTACCATCATGACACAACGGGTGTGTCTCTGGCTGTCCTTCAGGTCCT 

GGGCCAGTGGGGCACTAGTGTCTTTAGTAGATACCAGCTTTACTTTCCATCTTCCCTACTG 

GGGACAGAATATAATCAATCACTACTTTTGTGAACCTCCTGCCCTCCTGAAGCTGGCTTCC 

ATAGACACTTACAGCACAGAAATGGCCATCTTTTCAATGGGCGTGGTAATCCTCCTGGCCC 

CTGTCTCCCTGATTCTTGGTTCTTATTGGAATATTATCTCCACTGTTATCCAGATGCAGTCT 

GGGGAAGGGAGACTCAAGGCTTTTTCCACCTGTGGCTCCCATCTTATTGTTGTTGTCCTCTT 

CTATGGGTCAGGAATATTCACCTACATGCGACCAAACTCCAAGACTACAAAAGAACTGGA 

TAAAATGATATCTGTGTTCTATACAGCGGTGACTCCAATGTTGAACCCCATAATTTATAGC 

TTGAGGAACAAAGATGTCAAAGGGGCTCTCAGGAAACTAGTTGGGAGAAAGTGCTTCTCT 
CATAGGCAGTGA (SEQ ID NO: 200) 
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CAGTGCTGCTGGAGCATGGGGCAGCTTTATCTTACACACGCTGTGCTGCTCAGTTCTTTCT 

GTTCACCTTCITrGGTTCCATCGACTGCTACCTCTTGGCCCTCATGGCCTATGACCGCTACT 

TGGCTGTGTGCCAGCCCCTGCTTTATGTCACCATCCTGACACAGCAGGCCCGCTTGAGTCT 

TGTGGCTGGGGCTTACGTTGCTGGTCTCATCAGTGCCTTGGTGCGGACAGTCTCAGCCTTC 

ACTCTCTCCTTCTGTGGAACCAGTGAGATTGACTTTATTTTCTGTGACCTCCCTCCTCTGTT 

AAAGTTGACCTGTGGGGAGAGCTACACTCAAGAAGTGCTGATTATTATGTTTGCCATTTTT 

GTCATCCCTGCTTCCATGGTGGTGATCTTGGTGTCCTACCTGTTTATCATCGTGGCCATCAT 

GGGGATCCCTGCTGGAAGCCAGGCCAAGACCTTCTCCACCTGCACCTCCCACCTCACTGCT 

GTGTCACTCTTCTTTGGTACCCTCATCTTCATGTACTTGAGAGGTAACTCAGATCAGTCTTC 

GGAGAAGAATCGGGTAGTGTCTGTGCTTTACACAGAGGTCATCCCCATGTTGAATCCCCTC 

ATCTACAGCCTGAGGAACAAGGAAGTGAAGGAGGCCCTGAGAAAAATTCTCAATAGAGCC 

AAGTTGTCCTAA (SEQ ID NO: 192) 



AOLFR105 sequences: 

1 5 MOGLNHTSVSEFILVGFSAFPHLQLMLFLLFLLMYLFTLLGNLLIMATVWSERSLHMPMYLFLC 
ALSITElLYTVAnPRMLADLLSTQRSIAFLACASQMFFSFSFGFTHSFLLTVMGYDRYVAICHPL 
RYNVLMSLRGCTOIVGCSWAGGLVMGMVVTSAIFHLAFCGHKEIHHFFCHVPPLLKLACGDD 
VLVVAKGVGLVC1TALLGCFLLILLSYAFIVAAILK1PSAEGRNKAFSTCASHLTVVVVHYGFAS 
VIYLKPKGPQSPEGDTLMGITYTVLTPFLSPI1FSLRNKELKVA^1KKTCFTKLFPQNC (SEQ ID 

20 NO: 193) 

ATGCAGGGGCTAAACCACACCTCCGTGTCTGAATTCATCCTCGTTGGCTTCTCTGCCTTCCC 
CCACCTCCAGCTGATGCTCITCCTGCTGTTCCTGCTGATGTACCTGTTCACGCTGCTGGGCA 
ACCTGCTCATCATGGCCACTGTCTGGAGCGAGCGCAGCCTCCACATGCCCATGTACCTCTT 
25 CCTGTGTGCCCTCTCCATCACCGAGATCCTCTACACCGTGGCCATCATCCCGCGCATGCTG 
GCCGACCTGCTGTCCACCCAGCGCTCCATCGCCTTCCTGGCCTGTGCCAGTCAGATGTTCTT 
CTCCTTCAGCnTCGGCTTCACCCACTCCrTCCTGCTCACTGTCATGGGCTACGACCGCTACG 



TC 

30 CACCTCGCOTCTGTGG^ 

GAAGTTGGCCTGTGGAGATGATGTGCTGGTGGTGGCCAAAGGCGTGGGCJrTGGTGTGTAT 

CACGGCCCTGCTGGGCTGTTTTCTCCTCATCCTCCTCTCCTATGCCTTCATCGTGGCCGCCA 

TCTTGAAGATCCCTTCTGCTGAAGGTCGGAACAAGGCCTTCTCCACCTGTGCCTCTCACCT 

CACTGTGGTGGTCGTGCACTATGGCTTTGCCTCCGTCATTTACCTGAAGCCCAAAGGTCCC 

35 CAGTCTCCGGAAGGAGACACCTTGATGGGCATCACCTACACGGTCCrCACACCCTTCCTCA 

GCCCCATCATCTTCAGCCTCAGGAACAAGGAGCTGAAGGTCGCCATGAAGAAGACTTGCTT 

CACCAAACTCTTTCCACAGAACTGCTGA (SEQ ID NO: 194) 



AOLFR106 sequences: 

METANYTKWEFVLTGLSQTPEVQLVL^ 

FLDIWYSSITAPEMLHDFFVERKIISFDGCIAQLFFLHFAGASEMFLLTVMAFDLYTAICRPLHYA 
TIMNOPXCCILVALSWRGGFfflSnQVAirvTU.PFCGPNELDSWCDITQVVPaACANTFPEELVM 

ICSSGLISWCXIALLMSYAFLLALFKKESGSGENTNRAMST^ 

SFSLDKWSVTNTLIFPLRNPITYTLRNKEVKAAMRKLVTKYILCKEK (SEQ ID NO: 195) 



ATGGAAACTGCAAATTACACCAAGGTGACAGAATTTGTTCTCACTGGCCTATCCCAGACTC 
CAGAGGTCCAACTAGTCCTATTTGTTATATTTCTATCCTrCTATTTGTTCATC^ 
AATATCCTTATCATTTGCACCATCAGTCTAGACCCTCATCTGACCTCTCCTATGTATTTCCT 
GTTGGCTAATCTGGCCTTCCTTGATATTTGGTACTCTTCCATTACAGCCCCTGAAATGCrCA 

50 TAGACTTCTTTGTGGAGAGGAAGATAATTTCTTTTGATGGATGCA^ 

C^ACACTTTGCTGGGGCTrCGGAGATGTTCTTGCTCACAGTGATGGCCTTTGACCTCTACA 

CTGCTATCTGCCGACCCCTCCACTATGCTACCATCATGAATCAACGTCrc 
GTGGCTCTCTCCTGGAGGGGGGGCTTCATTCATTCTATCATACAGGTGGCTCTCATTGTTC 
GACTTCCTITCTGTGGGCCCAATGAGTTAGACAGTTACrTCTGTGACATCACACAGGTTCT 
55 CCGGATOCCTGTGCCAACACCTTCCCAGAGGAGTTAGTGATGATCTGTAGTAGTG^ 
ATCTCTGTGGTGTGTTTGATTGCTCTGTTAATGT 
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KSYSLDYDQAI^VVYSVLTPFLNPFIYSLRNKEIKEAVRRQLKRIGILA (SEQ ID NO: 187) 

ATGGACACAGGGAACTGGAGCCAGGTAGCAGAATTCATCATCTTGGGCTTCCCCCATCTCC 

> AGGGTGTCCAGATTTATCTCTrCCTCITGTTGCTTCTCATTTACCTCATGACTGTGTrGGGA 

AACCTGCTGATATTCCTGGTGGTCTGCCTGGACTCCCGGCTTCACACACCCATGTACCACT 

TTGTCAGCATTCTCTCCTTCTCAGAGCTTGGCTATACAGCTGCCACCATCCCTAAGATGCTG 

GCAAACTTGCTCAGTGAGAAAAAGACCATTTCATTCTCTGGGTGTCTCCTGCAGATCTATT 

TCTTTCACTCCCTTGGAGCGACTGAGTGCTATCTCCTGACAGCTATGGCCTACGATAGGTA 

TTTAGCCATCTGCCGGCCCCTCCACTACCCAACCCTCATGACCCCAACACTTTGTGCAGAG 

ATTGCCATTGGCTGTTGGTTGGGAGGCTTGGCTGGGCCAGTAGTTGAAATTTCCTTGATTT 

CACGCCTCCCATTCTGTGGCCCCAATCGCATTCAGCACGTCTTTTGTGACTTCCCTCCTGTG 

CTGAGTTTGGCTTGCACTGATACGTCTATAAATGTCCTAGTAGATTTTGTTATAAATTCCTG 

CAAGATCCTAGCCACCTTCCTGCTGATCCTCTGCTCCTATGTGCAGATCATCTGCACAGTGC 

TCAGAATTCCCTCAGCTGCCGGCAAGAGGAAGGCCATCTCCACGTGTGCCTCCCACTTCAC 

TGTGGTTCTCATCTTCTATGGGAGCATCCTTTCCATGTATGTGCAGCTGAAGAAGAGCTAC 

TCACTGGACTATGACCAGGCCCTGGCAGTGGTCTACTCAGTGCTCACACCCTTCCTCAACC 

CCTTCATCTACAGCTTGCGCAACAAGGAGATCAAGGAGGCTGTGAGGAGGCAGCTAAAGA 
GAATTGGGATATTGGCATGA (SEQ ID NO: 188) 

AOLFR102 sequences: 

MPVGKLVFNQSEPTEFVFRAFTTATEFQVLLFLLFLLLYL^m J CGNTAIIWWCTHSTLR^T^^ 
FLShfl^FLELCYTTVWPLMI^NILGAQKPlSIAGCGAQMFFFVTLGSTDCFLlAIMAYDRYVAI 
CHPLHYTLIMTRELCTQMLGGALGLAIJPSLQLTAL^ 

DmVHQAVLYVVSILVLTlPFLLlCVSYWITCAJLSmSAEGPJU^STCSFHLTVVLLQYGCCSL 

VYLRPRSSTSEDEDSQIALVYTFVTTPLLNPLLYSLRNKD VKGALRSA1IRKAASDAN (SEQ ID 
NO: 189) 

ATGCCTGTGGGGAAACTTGTCTTCAACCAGTCTGAGCCCACTGAGTTTGTGTTCCGTGCGT 
TOACCACAGCCACTGAATTCCAGGTTCTTCTCTTCCTTCT 

ATCCTCTGTGGCAACACAGGCATCATCTGGGTGGTGTGCACACACAGCACCCTCCGCACCC 

CGATGTATTTCTTCCTGTCCAACCTGTCTTTCCTGGAACTCTGCTACACCACCGTGGTAGTA 

CCCTTGATGCTTTCCAACATTTTGGGGGCCCAGAAGCCCATTTCGTTGGCTGGATGTGGGG 

CCCAAATGTTCTTCTrTGTCACCCTCGGCAGCACGGACTGTTrCCTCTTGGCGATCATGGCC 

TATGACCGCTATGTGGCTATCTGCCACCCGCTGCACTACACCCTCATCATGACCCGCGAGC 

TGTGCACGCAGATGCTGGGTGGGGCCCTGGGCCTGGCCCTCTTCCCCTCCCTGCAGCTCAC 

CGCCTTAATCTTCACCCTGCCCTTTTGCGGCCACCACCAGGAAATCAACCACTTCCTCTGCG 

ATGTGCCTCCCGTCCTGCGCCTGGCCTGCGCTGACATCCGCGTGCACCAGGCTGTCCTCTA 

TGTCGTGAGCATCCTCGTGCTGACCATCCCCTTCCTGCTCATCTGCGTCTCCTACGTGTTCA 

TCACCTGTGCCATCCTGAGCATCCGTTCTGCCGAGGGCCGCCGCCGGGCCTTCTCCACCTG 

CTCCTTCCACCTCACCGTGGTCCTGCTGCAGTATGGCTGCTGCAGCCTCGTGTACCTGCGTC 

CTCGGTCCAGCACCTCAGAGGATGAGGACAGCCAAATCGCGTTGGTCTACACCTTTGTCAC 

CCCCTTACTCAACCCTTTGCTTTACAGCCTTAGGAACAAGGATGTCAAAGGTGCTCTGAGG 

AGTGCCATTATCCGTAAAGCAGCCTCTGACGCCAACTGA (SEQ ID NO: 190) 

AOLFR103 sequences: 

MAEMNLTLVTEFLLIAFTEYPEWALPLFLLLLFMYLITVLGNLEMULILMDHQLHAPMYFLLSH 
LAFMDVCYSSITWQMLAVLLEHGAAL^YTRCAAQFFLFTFFGSIDCYLLALMAYDRYLAVCQ 
PLLYVTILTQQARLSLVAGAYVAGUSALVRTVSAFrLSFCGTSEIDFIFCDLPPLLKLTCGESYT 
QEVLIIMFAIFVlPASMVVlLVSYLFlIVAIMGn'AGSQAKTFSTCTSHLTAV 
NSDQSSEKNRWSVLYTEVIPMLNPLr/SLRNKEVKEALRKJLNRAKLS (SEQ ID NO: 191) 

ATGGCAGAGATGAACCTCACCTTGGTGACCGAGTTCCTCCTTATTGCATTCACTGAATATC 
CTGAATGGGCACTCCCTCTCnTCCTCTTGTTATTATTTATGTATCTCATCACCGTATTGGGG 
AACTTAGAGATGATTATTCTGATCCTCATGGATCACCAGCTCCACGCTCCAATGTATTTCCT 
TCTGAGTCACCTCGCTTTCATGGACGTCTGCTACTCATCTATCACTGTCCCCCAGATGCTGG 
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TTTATGTACCTGCGCCCCAGGTCCAGCTACTCCTTGACCCAGGACCGCACAGTTGCTGTCA 
TCTACACAGTGGTGATCCCAGTGCTGAACCCCCTCATGTACTCTTTGAGAAACAAGGATGT 
GAAGAAAGCTTTAATAAAGGTTTGGGGTAGGAAAACAATGGAATGA (SEQ ID NO: 182) 

5 AOLFR98 sequences: 

MRGFNKTTWTQFILVGFSSLGELQLLLFVIFLLLYLTILVANVTIMAVIRFSWTLHTPMYGFLFI 
LSFSESCYTFVIIPQLLVHLLSDTKTISFMACATQLFFFLGFACTNCLLIAVMGYDRYVAICHPLR 
YTLIINKRLGLELISLSGATGFFIALVATNL1CDMRFCGP>^VNHYFCDMAPVIKLACTDTHVKE 
LALFSLSILVIMVPFLLILISYGFIVNTILKIPSAEGKKAFVTCASHLTVVFVHYGCASIIYLRPKSK 
1 0 SASDKT»QLVAVTYTVVTPLLNPLVYSLR>^\T&TAUaiVLGMPVATKMS (SEQ ID NO: 1 83) 

ATGCGAGGTTTCAACAAAACCACTGTGGTTACACAGTTCATCCTGGTGGGTTTCTCCAGCC 
TGGGGGAGCTCCAGCTGCTGCTTTTTGTCATCnTTCTTCTCCTATACTTGACAATCCTGGTG 
GCCAATGTGACCATCATGGCCGTTATTCGCTTCAGCTGGACTCTCCACACTCCCATGTATG 

1 5 GCTTTCTATTCATCCTTTCATTTTCTGAGTCCTGCTACACTTTTGTCATCATCCCTCAGCTG 
TGGTCCACCTGCTCTCAGACACCAAGACCATCTCCTTCATGGCCTGTGCCACCCAGCTGTT 
CTTTTTCCTTGGCITTGCTTGCACCAACTGCCrCCTCATTGCTGTGATGGGATATGATCGCT 
ATGTAGCAATTTGTCACCCTCTGAGGTACACACTCATCATAAACAAAAGGCTGGGGTTGGA 
GTTGATTTCTCTCTCAGGAGCCACAGGTTTCTITATTGCTrTGGTGGCCACCAACCTCAT^ 

20 GTGACATGCGTTTTTGTGGCCCCAACAGGGTTAACCACTATTTCTGTGACATGGCACCTGT 
TATCAAGTTAGCCTGCACTGACACCCATGTGAAAGAGCTGGCTTTATTTAGCCTCAGCATC 
CTGGTAATTATGGTGCCTTTTCTGTTAATTCTCATATCCTATGGCTTCATAGTTAACACCAT 
CCTGAAGATCCCCTCAGCTGAGGGCAAGAAGGCCTTTGTCACCTGTGCCTCACATCTCACT 
GTGGTCTTrGTCCACTATGGCTGTGCCTCTATCATCTATCTGCGGCCCAAGTCCAAGTCTGC 

25 CTCAGACAAGGATCAGTTGGTGGCAGTGACCTACACAGTGGTTACTCCCTTACTTAATCCT 
CTTGTCTACAGTCTGAGG AACAAAGAGGTAAAAACTGCATTGAAAAGAGTTCTTGGAATG 
CCTGTGGCAACCAAGATGAGCTAA (SEQ ID NO: 184) 
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AOLFR99 sequences: 

MERVNETWREVIFLGFSSLARLQQLLFVIFIXLYL^ 

SEICYTFIT^KMLVDLI^QKKTISFLGCAIQMFSFLFLGCSHSFLI^VMGYDRYIAICNPLRYSV 
LMGHGVCMGLVAAACACGFTVAQnTSLVFHLPFySSNQLHHFFCDlAPVLKLASHHNHFSQIV 

IFMLCTLVLAIPLLLILVSYVHni>AILQ^^ 

SSSQDALISVSYTIITPLFOTMIYSLRNKEFKSALCIOVRRTISLL (SEQ ID NO: 185) 



ATGGAGCGGGTCAATGAGACTGTGGTGAGAGAGGTCATCTTCCTCGGCTTCTCATCCCTGG 
CCAGGCTGCAGCAGCTGCTCTTTGTTATCTTCCTGCTCCTCTACCTGTTCACTCTGGGCACC 
AATGCAATCATCATTTCCACCATTGTCCTGGACAGGGGCCTTCATATCCCCATGTACTTCTT 
CCTrGCCATCCTCTCTTGCTCTGAGATrTGCTACACCTTCATCATTGTACCCAAGATGCTGG 
40 TTGACCTGCTGTCCCAGAAGAAGACCATTTCTTTCCTGGGCTGTGCCATCCAAATGTTTTCC 
TrCCTCTTCCTTGGCTGCTGTCACrCCTTTCTGCTGGCAGTCATGGGTTATGATCGTTACAT 
AGCCATCTGTAACCCACTGCGCTACTCAGTGCTAATGGGACATGGGGTGTGTATGGGACTA 
GTGGCTGCTGCCTGTGCCTGTGGCTTCACTGTTGCACAGATCATCACATCCTTGGTATTTCA 
CCTGCCrTTTTATTCCTCCAATCAACTACATCACTTCTTCTGTGACATTGCTCCTGTCCT 
45 AGCTGGCATCTCACCATAACCACTTTAGTCAGATTGTCATCTTCATGCTCTGTACATTGGTC 
CTGGCTATCCCCnTATTGTTGATCTTGGTGTCCTATGTTCACATCCTCTCTGCCATACTTCA 
GTTTCCFTCCACACTGGGTAGGTGCAAAGCTTTTTCTACCTGTGTATCTCACCTCATTATTG 
TCACTGTCCACTATGGCTGXGCCTCCTTTATCTACTTAAGGCCTCAGTCCAACTACTCCTCA 
AGCCAGGATGCTCTAATATCAGTATCCTACACTATTATAACTCCATTGTTCAACCCAATGA 
50 TITATAGCITGAGAAATAAAGAGTTCAAATCAGCTCrTTGTAAAATTGTGAGAAGAACAAT 

TTCCCTGTTGTAA (SEQ ID NO: 186) 



AOLFR101 sequences: 

MDTGNWSQVAEFnLGFPHLQGVQr^FLLLLLrm^TVLG^LIFLWCLDSRLHTPMYHFVSI 
LSFSELGYTAATIPKMLANLLSEKKTISFSGCLLQIYFFHSLGATECYLLTAMAYDRY^I^L 
HYPTLMTPTLCAE1AIGCWLGGLAGPWEISLISRLPFCGPNRIQHWCDFPPVLSLACTDTSINV 
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TCGCCCACTTCAGAGACTGTCTTGTTCTGATCTCITTATCCATAGAATGATATCTTTTTCCT 

TATCATGTATTATTATCTTGCCTACTATCATAGTCATTATAGTATCTTACATGTATATTGTG 

TCCACAGTTCTAAAGATACATTCTACTGAGGGACATAAGAAGGCCTTCTCCACCTGCAGCT 

CTCACCTGGGAGTTGTGAGTGTGCTGTATGGTGCTGTCTTTTTTATGTATCTCACTCCTGAC 

5 AGATTTCCTGAGCTGAGTAAAGTGGCATCCTTATGTTACTCCCTAGTCACTCCCATGTTGA 

ATCCTTTGATTTACTCTCTGAGGAACAAAGATGTCCAAGAGGCTCTAAAAAAATTTCTAGA 
GAAGAAAAATATTATTCTTTGA (SEQIDNO: 178) 

AOLFR96 sequences: 

1 0 MICENHTRVTEFILLGFTWO'EMQVSLF^ 

I^LLEVCFTLVMWKMLVDLVSPRKTISFVGCGTQMYFFFFFGSSECFLLSMlvIAYDRFVAJCW 

LHYSVlM>niSLCLWMAIGSWMSGWVSMLQTAWMMALPFCGPNAVDHFFCDGPPVIJCLVTV 

DTTMYEMQAIASTLLFIMFPFCLILVSYTRinTIIJIMSSATGRQKAFSTCSSHLIVVSLFyGTASL 

TYLRPKSNQSPESKKLVSLSYTVlTPMLNPUYGLRN>ffiVKGAVKRTITQKVLQKLDVF (SEO 
15 ID NO: 179) 

ATGATCTGTGAAAATCACACCAGAGTCACTGAATTTATTCTTCTrGGTTTTACAAACAACC 
CCGAGATGCAAGTTTCCCTCTTTATTTTTTrCCTGGCCATTTATACAGTCACrTTGTTGGGC 
AACTrTCTTATTGTCACAGTTACCAGTGTGGATCTCGCACTTCAAACACCCATGTACTTCTT 
20 TCTTCAAAATCTGTCACTTCTTGAAGTATGTTTCACCTTGGTTATGGTGCCAAAAATGCTTG 
TAGATCTAGTGTCCCCAAGGAAAATTATCrCTTTTGTGGGCTGTGGTACCCAGATGTACTT 
CTTCTTCTTCTTTG G C AGTTCTG AATGTTTCCTTCTCTCC ATG ATG G CTT ATG ATCGCTTTGT 

GGCCATCTGTAACCCTCTCCATTATTCAGTCATAATGAACAGGTCCCTATGCTTGTGGATG 

GCCATAGGCTCTTGGATGTCCGGTGTTCCTGTGTCTATGCTACAGACAGCTTGGATGATGG 

25 CCCTTCCITrCTGTGGACCAAATGCCGTGGACCACTTTTTCTGTGATGGTCCCCCAGTGTTA 

AAACTAGTCACAGTGGATACAACCATGTATGAAATGCAAGCACTTGCCTCCACACTCCTGT 

TTATCATGTTTCCCTTTTGTCTCATTTTGGTTTCCTACACCCGCATTATCATAACAATTCTG 

AGGATGTCCTCTGCCACTGGCCGCCAGAAGGCATTTTCTACTTGTTCCTCACACCTCATTGT 

GGTGTCCCTCTTCTACGGAACAGCCAGTCTGACCTACCTGCGGCCCAAATCAAACCAGTCC 

CCTGAGAGCAAGAAGCTAGTGTCATTGTCCTACACTGTCATCACACCTATGCTAAACCCCA 

TCATCTACGGCCTGAGGAACAATGAAGTGAAAGGGGCTGTCAAGAGGACAATCACTCAAA 
AAGTCTTACAGAAGTTAGATGTGTTTTGA (SEQIDNO- 180) 
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AOLFR97 sequences: 

35 MTOFHLQSQMPSIRLIFRRIiSLGRIKPSQSPRCSTSFMV^^ 

LGLTTDPQLQRLLFVVFLGMYTATLLGNLVMFLLIHVSATLHTPMYSLLKSLSFLDFCYSSTVV 
PQTLVNFLAKRKVISYFGCMTQMFFYAGFATSECYLIAAMAYDRYAAICNPLLYSTIMSPEVC 
ASLIVGSYSAGFLNSLIHTGCIFSLKFCGAHVVTHFFCDGPPILSLSCVDTSLCEILLFIFAGFNLLS 
CTLmiSYFLILNTILKMSSAQGRFKAFSTCASHLTAICIJFGTTUWLPJRSSYSLTODRTVA 

40 VTYTVVIPVLNPLMYSLRNKDVKKALIKVWGRKTME (SEQ ID NO: 181) 

ATGACAGAGTTTCATCTGCAAAGCCAAATGCCCTCAATAAGACTCATCTTCAGAAGGCTGT 
CCTTAGGCAGAATTAAACCCAGTCAGAGCCCCAGGTGTTCAACCTCATTTATGGTGGTGCC 
TTCTTTCTCCATCGCAGAGCACTGGAGAAGGATGAAAGGGGCAAACCTGAGCCAAGGGAT 

45 GGAGTTTGAGCTCTTGGGCCTCACCACTGACCCCCAGCTCCAGAGGCTGCTCTTCGTGGTG 
TTCCTGGGCATGTACACAGCCACTCTGCTGGGGAACCTGGTCATGTTCCTCCTGATCCATG 
TGAGTGCCACCCTGCACAC^CCCATGTACTCCCTCCTGAAGAGCCTCrCCTTCTTGGATTTC 
TGCTACTCCTCCACGGTTGTGCCCCAGACCCTGGTGAACTTCTTGGCCAAGAGGAAAGTGA 
TCTCTTATTTTGGCTGCATGACTCAGATGTTCTTCTATGCGGGTTTTGCCACCAGTGAGTGC 

50 TATCTCATCGCTGCCATGGCCTATGACCGCTATGCCGCTATTTGTAACCCCCTGCTCTACTC 

aaccatcatgtctcctgaggtctgtgcctcgctgattgtgggctcctacagtgcaggaTtc 
ctcaattctcttatccacactggctgtatctttagtctgaaattctgcggtgctcatgtcgt 

CACTCACTTCTTCTGTGATGGGCCACCCATCCTGTCCTTGTCTTGTGTAGACACCTCACTGT 
GTGAGATCCTGCTCTTCATTTTTGCTGGTTTCAACCTTTTGAGCTGCACCCTCACCATCTTG 
ATCTCCTACTTCTTAATTCTCAACACCATCCTGAAAATGAGCTCGGCCCAGGGCAGGTTTA 
AGGCATTTTCCACCTGTGCATCCCACCTCACrGCCATCTGCCTCrTCITTGGCACAACACTT 
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GATTGCTGCTCCCTGGGATGCAGACAATATATTTGTCTGCCTGTGTTGTCCAGCTCTTCTTG 
TACC^GCTGTGGGGACAACAGAGTTCGCATTACTTGGAGCAATGGCTGTGGACCGTTATG 
TGGCTGTCTGTAACCCTCTGAGGTACAACATCATTATGAACAGACACACCTGCAACTTTGT 

GG^CT^GTGTCATGGGTGTTTGGGTTTCITTTTCAAATCT 
5 AGCTTACTTACTGCAAATCAAATGTGGTGAACAATTITTTrTGTGACCGAGGGCAATTGCT 

CAAACTATCCTGCAATAATACTCTITTCACGGAGTTTATCCTCrTC^ 
T^CTTrGGTrCTTTGATCCCTACAATTGTCTCCAACGCCTACATCATCTCCACCATTCTC 
AAGATCCCGTCATCCTCTGGCCGGAGGAAATCCTTCTCCACTTGTGCCTCCCACTTCACCT 
TGTTGTGATTGGCTACGGCAGCTGCTTGTTTCTCTACGTGAAACCCAAGCAAACGCAGGCA 
10 GCTGATTACAATTGGGTAGrTTCCCTGATGGTTTCAGTAGTAACTCCTTTCCTCAATCCm 
CATC^CACCCTCCGGAATGATAAAGTCATAGAGGCCCTTCGGGATGGGGTGAAACGCTGC 

TGTCAACTATTCAGGAATTAG (SEQ ED NO: 174) 




ATGGAGACGTGGGTGAACCAGTCCTACACAGATGGCTTCTTCCTCITAGGCATC^CTCCC 
ACAGTAC^TGCTGACCTTGTCCTCrrTCTCCGTGGTTATGGCGGTCT^ 

GGGAATGTCCTCCTCATCITCCTCATCTACATGGACCCTCACCnTCACACCCCCATGTACTT 

GCAGCCAACTTCCTGTCTGGCAGGAAGTCCATCTCCTTTGTGGGCTGTGGCATACAAATTG 
GCCTC^GTCTGTCTTGTGGGATCTGAGGGGCTCTTGCTGGGACTC^ 

ctatgtcg^ca™gccacccacttcactatcccatcct 
agatca^toagct^^ 

30 a^TGAATCC?^ 

to^a^ctggcctgtgtagacacatccctgtttgagaaggtgatatttg 

TC^CATGCTTCTCTTCCCATTCrCCATCATCGTGGCCTCCTATGCTCACATTCTAGG^^CT 
GTGCTGCAAA'TGCACTCTGCTCAGGCCTGGAAAAAGGCCCT 

^ CCGGGCCCCCAGCCATGACAAGGTGG 

AACCCCCTCATITAO^GCn^AGGAACAGGGAGGTGATGGGGGCACT 

GACCGCTGCAGGATCGGCAGCCAGCACTGA (SEQ ID NO: 176) 



AOLFR95 sequences: 

4n mi fiqKPRVHLYILPCASOOVSTMGDRGTSNHSEMTDFILAGFRVIU>ELHIIXFLLFLFVYAMILL 

T LIVTEGFLLAAN^ 

cis^GWSV™ 
45 KKNIIL (SEQ ED NO: 177) 

ATGCTAGGATCCAAACCAAGAGTTCATTTGTATATTTTGCCCT 
TACCATGGGTGACAGGGGAACAAGCAATCACTCA 

otcagggtacgcccagagctccacattctcctcttcctgctatttttg™ 
50 Stcotcta^ggaatgttgggatgatgaccattattatgact^^ 

ACCAATGTATTTTTTCCTAGGCAATCTCTCCTTCATTGATCITTTCT 
ACCCAAGGCTATGATCAACTTCTGGTCTGAAAACAAGTCTATCTCCITTGCAGGCE 

GCCCAGCTCT^CTCTTTGCCCTCCTCATTGTGACTGAGGGATTTCTC 

™gaSg^aSgccatctgcaaccctctgctctactctg^ 

55 TCTGTACTCAGTEGGTGGCTGGTTC^ 
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SALVILSSI^^GSYVYIISmRJPSTQGRQKAFSTCASHlTWSIAHGSNIFVYVRPNONSSLD 
YDKVAAVLITWTPLLNPHYSIJ^KVQEVLRETVNRIMTLIQRKT (SEQ ID NO: 169) 

^GGGAAACTGGAGCACTGTGACTGAAATCACCCTAATTGCCTTCCCAGCTCTCCTGGAGA 

TTCGAATATCTCTCTTCGTGGTTCTTGTGGTAACTTACACATTAACAGCAACAGGAAA 

CACCATCATCTCCCTGATATGGATTGATCATCGCCTGCAAACTCCAATGTACTTCTTCCTCA 

GTAATTTGTCCTTTCTGGATATCITATACACCACTGTCATTACCCCAAAGrrGTTGGCCTGC 

CTCCTAGGAGAAGAGAAAACCATATCTTTTGCTGGTTGCATGATCCAAACATAT^CT^ 

TCTTTCTGGGGACGGTGGAGTTTATCCTCTTGGCGGTGATGTCCTTTGACCGCTACATGG^ 

TATCTGCGACCCACTGCACTACACGGTCATCATGAACAGCAGGGCCTGCCTrCTGCTGGTT 

CTGGGATGCTGGGTGGGAGCCTTCCTGTCTGTGTTGTTTCCAACCATTGTAGTGACAAG^GC 

TACCrTACTGTAGGAAAGAAATTAATCATTTCTTCTGTGACATTGCCCCTCTTCTTCAGGTG 

GCCTGTATAAATACTCACCTCATTGAGAAGATAAACTTTCTCCTCTCTGCCCITGTCATCCT 

GAGCTCCCTGGCATTCACTACTGGGTCCTACGTGTACATAATITCrACCATCCTGCGtATCC 

CCTCCACCCAGGGCCGTCAGAAAGCTTTTTCTACCTGTGCTTCTCACATCACTGTTGTCTCC 

ATTGCCCACGGGAGCAACATCTTTGTGTATGTGAGACCCAATCAGAACTCCTCACTGGAIT 

ATGACAAGGTGGCCGCTGTCCTCATCACAGTGGTGACCCCTCTCCTGAACCCTTTTATCTA 

CA ^ GAGGAA I? A ^^ 

CTTGATACAAAGGAAAACTTGA (SEQ ID NO: 170) 
AOLFR92 sequences: 

MRN^ITEFILLGFPVIQGLQTPLFIAIFLTYILTLAGNGLIIATVWA£PRLQn'MYFFLCNl^FLE 

iwytttvipkllgtfwart^cmsccllqaffhffvgttefliltimsfdr™cnp^ 

TSKLCLQI^LSSWVVGFTIWCQTMLLIQLPFCGNN^ 
T^^^GSLLFNMISYIYILSAILRIPSATGHQKTFSTCASH^ 

INKWSVLNTILTPLLNPFnrmNKEVKGALRKAMTCPKTGHAK (SEQIDNoTTt^ 

ATGAGAAATGGCACAGTAATCACAGAATTCATCCTGCTAGGCTTTCCTGTTATCCAAGGCC 

™^ CACCTCTCmA ^ GC ^ T( ^CTCACCTACATArTAACCCTTGCAGGCAATGGG 

CTTA^ATTGCCACTGTGTGGGCrGAGCCCAGGCTACAAATTCCAATGTACTTCTrCCrHG 

^OTGTCTrTCTTAGAAATCTGGTACACCACCACAGTCATCCCCAAACrGCTAGGAACC 

T^TAGTGGCAAGAACAGTAATCTGCATGTCCTGCTGCCTGCTGCAGGCCTTCTTCCACT 
TCTTOOTXKXK^CCACCOAarTCrTQ^ 

ATCTGCAATCCCCTTCACCACCCCACCATCATGACCAGCAAACTCTGCCTGCAGCroG^^C 

TGAGCTCCTGGGTGGTGGGCTTCACCATTGTCrTTTGTCAGACGATGCTGCT 

^™ GTGGC ^ TAATG " ATCAGTCA ^ CTACTGTGA TGITGGGCCCAGI^^ 

S CCGCC J^ ATAGACACCAGCAmGGAACTCCTGGG ^^ 

T^ GA ^ TCAC ^CTTTAATATGATTTCTTATA1CTAC^^ 

ATTCCTTCAGCCACTGGCCACCAAAAGACTTTCTCTACCTGTGCCTCGCACCTGACAGT^^ , 
AGATTAATAAGGTGGTGTCTGTGCTAAATACTATCCTCACCCCCCrTCTGAATCCCrTTATT 

A OLFR93 sequences: " 

MLMNYSSATCFYLLGFPGSEELHmLFAIFFFFYLVTLMGNTVnm 
P^YNTIMNRHTCWW^ 

NTLFTEFILFLMAVFVLFGSLIPTrVSNAYnSTILKIPSSSGRJIKSFSTCASHFTCVVlG 
V^KQTQAADYNWWSIMVSWTPFLNPFIFTL^ 

A ^]T? A I?:^II A ^^ 

AC I ACATCATATCC1 ^ 1 ^ GCTA TATTCTTCTTTTTCTAC^ 

CAGTCATCATCATGATrGTCrGTGTGGATAAACGTCTGCAGTCCCCCATGTATTTCrTCCTC 
GGCCACCTCTCTGCCCTGGAGATCCTGGTCACAACCATAATCGTCCCCGTGATGCTTTGGG 
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CLGGAECFLLAVMSYDRYVAICHPLRYAVLMNKKVGLMMAVMSWLGASWSLOT^^ 

Sfcctrk^^S^avvklvcgditvyettvyiss 

KALRRVLRRDV1TQCIQRLQLWLPRV (SEQ ID NO: 165) 

ATGCTGGACCCCAGTATTTCCAGTCACACT(nTTATCTCCACTCTCTGTTTCCTCAGG 

gIga^ggggTc^ 

GCTOTCGATGA^CCTTACCCACCTm 
SSn^GAGTGGCAACA^ 

10 Sa5?g?a1^c^ 

CCTGAAGATGGCTACCAACTACCTATCTGGCAAGAAATCTATCT 

aScagcacttotctaittgtgtctaggtggtgctgaa^^ 

cta?Sac^gctatg^gccatctgtcatccac^^ 

SgggISg^gatggctgtcatgtcatgg™^ 

1 S ?Ar rrX^TTGATG^ 

GTTCCC^GCTG^G^TGAAG^GGTATGTGGCGACATCACTGTGTATC 

G^A^GXGATG^LcACCCAGTGCATTCAACGACT 
TAG (SEQ ID NO: 166) 

25 AOLFR90 sequences: 
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xr-TTPTP A A TG ACAACAG AAGC ACTC AATAATTTTGC ACTTGG ATGTACCAACTTGTTAA 

tca^Sga^acca^^ 

T*A^^G^ 

Igc^ccctgct 

TGCGTGCTTCTCATCTGTCATCACCCCAAAGATGATTG^^ 

40 A^CATCTCTTTTGAAGGCTGCATC 

trrTr^GTCCT^CAGCCATGGCCTATGATCG 

^cSc^g^a^ccItcatacaaat^ 
^ I?I5ct^gcctcatggtggtcatcaacagtgg^ 

TA^GTATATACACGACCrCCATCTG^ 

I^cItota^tcccttgctcaatccittga 

so i£££^^ 

AACTTTAA (SEQ ID NO: 168) 

AOLFR91 sequences: 
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AOLFR87 sequences: 

M>O v nAQLSLGHDLGffSVLQKIILTTQILLFXMY 

LLGLTQNAEAQKiLFAVFTXIYFLTMVDNLUWTITTSPALDSPVYF^FFSFIDGCSSSTMAP 

KMIFDLLTEKKTISFSGCMTQLFVEHFFGGVEIILLVVMAYDCWAJCKPLYYLITMNROVCGL 
LVAMAWVGGFIJIAIJQMIXIVW^^ 

CMLIFSILITSYVLILCSQRKALSTCAFHITWVLFFVPCIL^ 
LNPLIYTLRNTEVKNAMKQLWSQIIWGNNLCD (SEQ ID NO: 161) 

ATGAATAACATAGCTCAACTTAGTCTTGGGTTTATAGATTTAGGGATTCCATCAGTGTTAC 

TTGTGCTATTCACAGAAAAATCAATTATCCAAATACCAAACTGGATTTCGAGCAAGTGAAC 

^^C^TTCATCTTGCTTGGCCTGACACAGAACGCAGAGGCACAGAAACrmcn 
^GCTGTGTTTACACTCATCTACTT^^ 

ATCACCACCAGCCCAGCCCTGGACTCCCCCGTGTATTTTTTTCTGTCTTT^ 

AAAACTATTTCCTTCAGTGGGTGCATGACCCAGCTCTTTGTAGAACATITCTTrGGGGGAG 

^GAGATCATTCTGCTCGTGGTGATGGCCTATGACTGCTATGTGGCCATCTGCAAG^ 

GTACTACCTGATCACAATGAACAGGCAGGTATGTGGCCTC^ 

GGGGGATTTCTTCACGCTCTGATTCAAATGCTTTTAATAGTCTGGCTGCCCTT^ 
C ^^™ ACCA ™ ATCTGTGACCmCCCT ^ 

CrcACGTCTTTGGACTCTTTGTTGCCGCCAACAGTGGGCTGATGTGTATGCTCATTTTTTCT 
ATTCTTATTACCTCTTACGTCCTAATCCTCTGCTCACAGCGGAAGGCTCTCTCTACCTGCGC 
CTrCCATATCACTGTAGTCGTCCTATTCTTTGTTCCCTGTATATTGGTGTACCrrC^ 
T GATCAC ^ CCCTA ^ GATAAA GCTGTGTCTGTGT^ 

AGCCAAATAATCTGGGGTAACAATTTGTGTGATTAG (SEQ ID NO: 1 62) 
AOLFR88 sequences: 

PLRYAVLM>JKKVGLMMA\^SWLGASVNSLIHMAILMHITFCGPRKVYHFYCE^ 

^^^ lo ^ vg ^^ 

A ^ GGAGAAG ^ 

GCTCAGCCAGCTCTCCCTCATGGATCTGATGCATGTCTCCACAATCATCCTGAAGATGGCT 
^ A ^ AGCTATCTGGCAAGAAATCTATCTCC ^^ 

TCTATTTGTGTCTAGGTGGTGCTGAATGTTTTCTCTTAGCTGTCATGTCCTATGACCGCT^T 

GTTGCCATCTGTCATCCACTGCGCTATGCTGTGCTCATGAACAAGAA^T^ 

I GGC ™ TCATGTCATG ^^ 

^ A ™ GC 7" CTGTGGGCCTCG ^ 

I^^^totc^acatcact^^ 

CCTCCTCCTCCCCATCTTCCTGATTTCTACATCCT^^ 

G ATGCGCTCATCHXX3GAGCAAGAGAAATGCCTTTGCCACTTGTC 
G^CTCTITGGTTrGGTGC^ 

A ^ CAGAACAAAOTGG ^ CTGTG ^CTACAGCATCATTACGCCCACATTGAA1TCTCTG 
GTTATCACCCAGTGCATTCAACGACTGCAATTGTGGTTGCCCCGAGTGTAG (SEQ ID NO: 

AOLFR89 sequences: 
^PS IS SH^YLHSLFPQGLRKGT^ 

GWLTILUCmPQLHTPMYFLI^QL^LMDLMHVSTTILKMATTvrtXSGKKSISW 
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ATATATACTCTGAGGAATGACACAGTGCAGGGAGTCCTCAGGGATGTGTGGGTCAGGGTT 

c"gaggagt^gaa^^ 

GACCAOCAAGGAAGGGCTTGCTCnTCTCCACCATGTGTCT 
AG (SEQ ID NO: 156) 

5 

AOLFR85 sequences: 

MGAKNNVTEFVLFGLFESPJiMQHTCFVWFLFIFVLT^ 

SF^yWtTIPKMIADTFVEHK^^ 

TAIMDCRKCGLLAGASWL^ 

10 g^a^gmIslasffiliisyviillnlrsqs 

TTLAADKLIILFN1VMPPLLNPL1YTLRNNDVKNAMRKLFRVKRSLGEK (SEQ ID NO: 157) 

atgggtgccaagaacaatgtgactgagtttgttttatitggcctttttgagagcagag^ 

TGCAGCATACATGCTTTGTGGTATTCTTCCTCT^ 

1 5 ctSgtcatcatcaccatcaatgctagaa^ 

^CAG^GTCTT^GCTGACATATGTTATCCATCCACTACCATACCCAAGATGATTGCTGAC 
CTTCT^GGTG^CACTOAGATCITC 

r^^TGCAGAcXcCT^ 

GTCATCTGAGGACCGGCGTAAGGCTGTCTCCACATGTGGCTCACACGTAATCACTGT 
2S ^GGTTCTCATOCCC^CCATGTTCATGTACATTCGTCCCTCCACCACCCTGGCT 

gSIa^cga^ 
agaagtga (seq id no: 158) 

30 AOLFR86 sequences: 

35 £££5^^ 

KLFTW (SEQ ID NO: 159) 

CCCTGTGTATT^rVlTCCTGGCCAACCTATCCTTTA 

• -TCCTAAA^^CATTTCCTC 

GCTCAGCTCTTTGGAGCrCATTTTTTGGGAGGTGTTGAGATCATTCTGCTCACAGTG 

45 C^ATGACCGCTATGTGGCCATCTGTAAGCCCCTGCACAATACTACCATCAT^^ 

TCTCTGTGCCATGC^GTAGGGGTGGCTTGGCI^ 

CTCCTCCTGCTCCrr^ 

SAACAGTGGmAATCTGCCTGTTGA^ 
50 cS^CTC^GGTCCCACAGTGCAGATGGGAGAT^ 

r^CCACTTCATTGTTGTTGCCTTGTTCTTTGTGCCCTGTATATTTACTT 

TCTACTTTAOCTATAGACAAAAATATGGCATTATT^ 

TCCACTCATTTATACCCTGAGAAATGAAGAGGTAAAAAATGCCATGAGAAAGCTCTTTACA 



TGGTAA (SEQ ID NO: 160) 



55 



92 



•ISDOCID <WO 0198526A2_L> 



WO 01/98526 



PCT/US01/20122 



ACAGCCGTGTGGCTGCTCTGCGCCGTCAACACGGCCATCCACACGGGGCTGATGCTGCGCT 

TGGATTTCTGTGGCCCCAATGTCATTATCCATTTCTTCTGCGAGGTCCCTCCCCTGCTGCTT 
CTCTCCTnrA ar~rrr a rrTA rr;Tr a a rv-i/rrv-DTv a t-^. a t-^-t^-,^™,-,,-.,, — 



J AO J OAACrTCCTGATGACCATCGCGTCCTATGGCTTCATCGTCTCCAGCATCCTGAA 

GGTGAAGACTGCCTGGGGGAGGCAGAAAGCCTTCTCCACCTGCTCTTCCCACCTCACCGTG 

GTGTGCATGTATTACACCGCTGTCTTCTACGCCTACATAAGCCCGGTCTCTGGCTACAGCG 

CAGGGAAGAGCAAGTTGGCTGGCCTGCTGTACACTGTGCTGAGTCCTACCCTCAACCCCCT 

CATCTATACTTTGAGAAACAAGGAGGTCAAAGCAGCCCTCAGGAAGCTTTTCCCTTTCTTC 
AGAAATTAA fSEO ID NO: 1 <!T> 



AGAAATTAA (SEQ ID NO: 152) 
AOLFR82 sequences: 



MQLWINVTEFILLGLTQDPFWKKJVFVIF^^ 

SDT(XSTSIAPPJVIIVDALLKICTTISFSECMIQVFSSHWGCLEffILILTAVDRYVDlCKPLHYMTn 
SQWVCGVLMAVAWVGSCVHSLVQfflALSLPFCGPNVINHCFOTLQPLLKQACSETYVVNLLL 
VSNSGAICAVSYWLIFSYVIFLHSLRhlHSAEVIKKAI^TCVSHnVVILFFGPCIFMYTCPATVFP 
MDKMIAVFYTVGTSFLNPVTVTLKNTEVKSAMRKLWSKKXITDD^ (SEQ ID NO- 153) 



ATGCAACTGAATAATAATGTGACTGAGTTCATTCTGCrrTGGATTGACACAGGATCCrTTTT 
GGAAGAAAATAGTGTTTGTTATTTTTTTGCGTCTCTACTTGGGAACACTGTTGGGTAATTT 
GCTAATCATTATrAGTGTCAAGGCCAGCCAGGCACTTAAGAACCCAATGTTCTTCTTCCTT 

" cta <^atctttatctgatacttgcctctct^ 

TGCCCTTTTGAAGAAGACAACTATCTCCTTCAGCGAGTGCATGATCCAAGTCrTTTCATCC 

CATGTCTTTGGCTGCCTGGAGATCTTCATCCTCATCCTCACGGCTGITGACCGCTATGTGGA 

CATCTGTAAGCCCCTGCACTACATGACCATCATAAGCCAGTGGGTCTGTGGTGTTTTGATG 

GCrGTGGCCTGGGTGGGATCCrGTGTGCATTCTTTAGTTCAGATTTTTCrfTGCCCTGAGTTT 

GCCATTCTGTGGCCCCAATGTGATCAATCACTGTTTCTGTGACTTGCAGCCCTTGTTGAAA 

CAAGCCTGTTCAGAAACCTATGTGGTTAACCTACTCCTGGTTTCCAATAGTGGGGCCArTT 

GTGCAGTGAGTTATGTCATGCrAATATTCTCCTATGTCATCTTCTTGCATTCTCTGAGAAAC 

CACAGTGCTGAAGTGATAAAGAAAGCACTTTCCACATGTGTCTCCCACATCATTGTGGTCA 

TCnTGTTCTTTGGAC CTTGC ATATTTATGTACACATGCCCTGCAACCGTATTCCCCATGGAT 

AAGATGATAGCTGTATTTTATACAGTTGGAACATCTTTTCTCAACCCTGTGATTTACACGCT 

GAAGAATACAGAAGTGAAAAGTGCCATGAGGAAGCTTTGGAGCAAGAAATTGATCACAGA 
TGACAAAAGATAA (SEQ ED NO: 154) 

AOLFR83 sequences: 

MGNWAAVTEFVLLGFSLSPJEVELLLL\O.LLPTFLLTLLG^LnSTVLSCSRLHTPMYFFL 

SILDELFTSVISPKVIAM-GSRDKTISFAGCITQCYFYFFLGTATEFLLLTVMSYDRYATICCPLRYT 

TIMRPSVCIGTVWSWVGGFLSVLFPmiSQLPFCGSNnNHFFCDSGPLLALACADTTAJELMDF 
MLSSMVHXDCIVLVAYSYTYnLTTVPJPSASGRKKA^ 

EINKIPLVLSSWTPFLWFIYTLRNDWQGVUtDVWW^ 
GRACSSPPCVYS VKLQC (SEQ ID NO: 1 55) 

ATGGGTAACTGGACTGCAGCGGTGACrGAGTTTGTTCTGCTGGGGTTTTCCCTGAGCAGGG 

AGGTGGAGCTGCTGCTCCTGGTGCTCCTGCTGCCCACGTTCCTGCTGACTCTTCTGGGGAA 

CCTGCTCATCATCTCCACTGTGCTGTCCTGCTCCCGCCrCCACACCCCCATGTACTTCTTCT 

TGTGCAACCTCTCTATCCTGGACATCCTCTTCACCTCAGTCATCTCTCCAAAAGTGTTGGCC 

AACTTAGGATCTAGGGATAAAACCATCTCCrTTGCCGGATGTATCACCCAGTGCTATTTCT 

ACTTTTTCrTGGGCACAGTTGAGTTCCTCCTGCTGACGGTCATGTCCTATGACCGTTATGCC 

ACCATCTGCTGCCCCCTGCGGTACACCACCATCATGAGACCTTCTGTCTGCATTGGGACCG 

TrGTATTCTCTTGGGTGGGAGGCTTCCTGTCTGTGCTCTTTCCAACCATCCTCATCTCCCAG 

CTGCCCTTCTGTGGCTCCAATATCATTAACCACTTCTTCTGTGACAGTGGACCCTTGCTGGC 

CCrGGCCTGTGCAGACACCACTGCCATCGAGCTGATGGATTTTATGCTTTCTTCCATGGTC 

ATCCTCTGCTGCATAGTCCTCGTGGCCTATTCCTATACGTACATCATCTTGACCATAGTGCG 

CATTCCTTCTGCAAGTGGAAGGAAGAAGGCCTTTAATACCTGTGCTTCCCACCTGACCATA 

GTCATCATTCCTAGTGGCATCACTGTGTTTATCTATGTGACTCCCTCCCAGAAAGAATATCT 

GGAGATCAACAAGATCCCTTTGGTTCTGAGCAGTGTGGTGACTCCATTCCTCAACCCCTTT 
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CTATGCCATAACAGTGGTGGGCAACTTGGGAATGATGGCACTCATCTTCACAGACTCCCAT 
CTCCAAAGCCCAATGTATTTCTTCCTCAATGTCCTCTCGTTTCTTGATATTTGTTACTCTTCT 
GTGGTCACACCTAAGCTCTTGGTCAACTTCCTGGTCTCTGACAAGTCCATCTCTTTTGAGG 
GCTGTGTGGTCCAGCTCGCCTTCTTTGTAGTGCATGTGACAGCTGAGAGCTTCCTGCTGGC 
5 CTCCATGGCCTATGACCGCTTCCTAGCCATCTGTCAACCCCTCCATTATGGTTCTATCATGA 
CCAGGGGGACCTGTCTCCAGCTGGTAGCTGTGTCCTATGCATTTGGTGGAGCCAACTCCGC 
TATCCAGACTGGAAATGTCTTTGCCCTGCCTTTCTGTGGGCCCAACCAGCTAACACACTAC 
TACTGTGACATACCACCCCTTCTCCACCTGGCTTGTGCCAACACAGCCACAGCAAGAGTGG 
TCCTCTATGTCTTTTCTGCTCTGGTCACCCTTCTGCCTGCTGCAGTCATTCTCACCTCCTACT 
1 0 GCTTGGTCTTGGTGGCCATTGGGAGGATGCGCTCAGTAGCAGGGAGGGAGAAGGACCTCT 
CCACTTGTGCCTCCCACTTTCTGGCCATTGCCATTTTCTATGGCACTGTGGTTTrCACCT 
GTTCAGCCCCATGGATCTACTAACAATACCAATGGCCAAGTAGTGTCCGTCTTCTACACCA 
TCATAATTCCCATGCTCAATCCCTTCATCTATAGCCTCCGCAACAAGGAGGTGAAGGGCGC 
TCTGCAGAGGAAGCTTCAGGTCAACATCTTTCCCGGCTGA (SEQ ID NO: 148) 
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AOLFR80 sequences: 

MEGIMCTAKMQFFFRPFSPDPEVQMLlF\^LMMYLTSLGGNAmVIVQINHSLHTPMYFFLA 
NLAVLEIFYTSS1TPLALANLLSMGKTPVSITGCGTQMFFFVFLGGADCVLLVVMAYDRFIAICH 
PLRYP^IMSWSLCVELLVGSLVLGFLLSLPLTILIFHLPFCHNDEIYHFYCDMPAVMRLACADTR 
20 VHKTALY1ISFIVLSIPLSLISISYVFIWAILRIRSAEGRQQAYSTCSSHILVVLLQYGCTSFIYLSPS 
SSYSPEMGRWSVAYTFITPILNPLIYSLRNKELKDALRKALRKF (SEQ ID NO: 149) 

ATGGAAGGAATAAATAAAACTGCAAAGATGCAGTTTTTCTTTCGTCCATTCTCACCTGACC 
CTGAGGTCCAGATGCTGATTTTTGTGGTCTTCCrGATGATGTATCTGACCAGCCTCGGTCG 

25 AAATGCTACAATTGCAGTCATTGTTCAGATCAATCATTCCCTCCACACCCCCATGTACTTTT 
TCCTGGCT AATCTG G C AGTTCTAG AAATCTTCT AT ACATCTTCCATCACCCCATTGGCCTTG 
GCAAACCTCCTTTCAATGGGCAAAACTCCTGTTTCCATCACGGGATGTGGCACCCAGATGT 
TTTTCTTTGTCTTCTTGGGTGGGGCTGATTGTGTCCTGCTGGTAGTCATGGCTTATGACCGG 
TTTATAGCGATCTGTCACCCTCTGCGATACAGGCTCATCATGAGCTGGTCCTTGTGTGTGG 

30 AGCTGCTGGTAGGCTCCTTGGTGCTGGGGTTCCTGTTGTCACTGCCACTCACCATTTTAATC 
TTCCATCTCCCATTCTGCCACAATGATGAGATCTACCACTTCTACTGTGACATGCCTGCAGT 
CATGCGCCTGGCTTGTGCAGACACACGCGTTCACAAGACTGCTCTGTATATCATCAGCTTC 
ATCGTCCTTAGCATCCCCCTCTCATTGATCTCCATCTCCTATGTCTTCATCGTGGTAGCCAT 
TTTACGGATCCGGTCAGCAGAAGGGCGCCAGCAAGCCTACTCTACCTGCTCTTCTCACATC 

35 TTAGTGGTCCTCCTGCAGTATGGCTGCACCAGCTTTATATACTTGTCCCCCAGTTCCAGCTA 
CTCTCCTGAGATGGGCCGGGTGGTATCTGTGGCCTACACATTTATCACTCCCATTTTAAAC 
CCCTTGATCTATAGTTTGAGGAACAAGGAACTGAAAGATGCCCTAAGGAAAGCATTGAGA 

AAATTCTAG (SEQ ED NO: 1 50) 

40 AOLFR81 sequences: 

MGVKNHSTWEFLLSGLTEQAELQLPLFCLFLGIYTVTWGNLSMISIIRLNRQLHTPMYYFLSS 
LSFLDFCYSSVITPKMMKLWMESHLIVPETPJ'SPRMMSNQTLVTEFILQGFSEHPEYRVFLFSCF 
LFLYSGALTGNVLITLAlTFNPGLHAPMYFFLLNLATMDnCTSSIMPKALASLVSEESSISYGGC 
MAQLYFLTWAASSELLLLTVMAYDRYAAICHPLHYSSMMSKWCSGLATAVWLLCAVNTAIH 

45 TGlllLRLDFCGPNVIIHITCEVPPLLLLSCSSTY^ 
~ UCVKTAWGRQKAFSTCSSHLTWCMYYTAWYAYISPVSGYSAGKSK1AGLLYTVLSPTLNPL 

rYTLRNKEVKAALRKLFPFFRN (SEQ ID NO: 151) 

ATGAAGCTGTGGATGGAGAGTCACCTGATAGTCCCAGAAACCCGTCCCAGCCCAAGGATG 
50 ATGAGTAACCAGACGTTGGTAACCGAGTTCATCCTGCAGGGCTTTTCGGAGCACCCAGAAT 
ACCGGGTGTTCTTATTCAGCTGTTTCCTCTTCCTCTACTCTGGGGC 

CTCATCACCTTGGCCATCACGTTCAACCCTGGGCTCCACGCTCCTATGTACrTTTTCTTACT 
CAACTTGGCTACTATGGACATTATCTGCACCTCTTCCATCATGCCCAAGGCGCTGGCCAGT 
CTGGTGTCGGAAGAGAGCTCCATCTCCTACGGGGGCTGCATGGCCCAGCTCTATTTCCTCA 
55 CGTGGGCTGCATCCTCAGAGCTGCTGCTCCTCACGGTCATGGCCTATGACCGGTACGCAGC 
CATCTGCCACCCGCTGCATTACAGCAGCATGATGAGCAAGGTGTTCTGCAGCGGGCTGGCC 
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